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Field of the Invention 

[0001] This application claims the benefit of U.S. Provisional Application No. 60/033,381, 
filed Dec. 16, 1996. The invention relates to the molecular modification of gymnosperms in 
order to cause the production of syringyl units during lignin biosynthesis and to production 
and propagation of gymnosperms containing syringyl lignin. 
Background of the Invention 

[0002] Lignin is a major part of the supportive structure of most woody plants including 
angiosperm and gymnosperm trees which in turn are the principal sources of fiber for making 
paper and cellulosic products. In order to liberate fibers from wood structure in a manner 
suitable for making many grades of paper, it is necessary to remove much of the lignin from 
the fiber/lignin network. Lignin is removed from wood chips by treatment of the chips in an 
alkaline solution at elevated temperatures and pressure in an initial step of papermaking 
processes. The rate of removal of lignin from wood of different tree species varies depending 
upon lignin structure. Three different lignin structures have been identified in trees: p- 
hydroxyphenyl, guaiacyl and syringyl, which are illustrated in FIG. 1 . 
[0003] Angiosperm species, such as Liquidambar styraciflua L. [sweetgum], have lignin 
composed of a mixture of guaiacyl and syringyl monomer units. In contrast, gymnosperm 
species such as Pinus taeda L. [loblolly pine] have lignin which is devoid of syringyl 
monomer units. Generally speaking, the rate of delignification in a pulping process is directly 
proportional to the amount of syringyl lignin present in the wood. The higher delignification 
rates associated with species having a greater proportion of syringyl lignin result in more 
efficient pulp mill operations since the mills make better use of energy and capital investment 
and the environmental impact is lessened due to a decrease in chemicals used for 
delignification. 
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[0004] It is therefore an object of the invention to provide gymno sperm species which are 
easier to delignify in pulping processes. 

[0005] Another object of the invention is to provide gymnosperm species such as loblolly 
pine which contain syringyl lignin. 

[0006] An additional object of the invention is to provide a method for modifying genes 
involved in ligning lignin biosynthesis in gymnosperm species so that production of syringyl 
lignin is increased while production of guaiacyl lignin is suppressed. 
[0007] Still another object of the invention is to produce whole gymnosperm plants 
containing genes which increase production of syringyl lignin and repress production of 
guaiacyl lignin. 

[0008] Yet another object of the invention is to identify, isolate and/or clone those genes in 
angiosperms responsible for production of syringyl lignin. 

[0009] A further object of the invention is to provide, in gymnosperms, genes which 
produce syringyl lignin. 

[0010] Another object of the invention is to provide a method for making an expression 
cassette insertable into a gymnosperm cell for the purpose of inducing formation of syringyl 
lignin in a gymnosperm plant derived from the cell. 
Definitions 

[0011] The term "promoter" refers to a DNA sequence in the 5' flanking region of a given 
gene which is involved in recognition and binding of RNA polymerase and other 
transcriptional proteins and is required to initiate DNA transcription in cells. 
[0012] The term "constitutive promoter" refers to a promoter which activates transcription 
of a desired gene, and is commonly used in creation of an expression cassette designed for 
preliminary experiments relative to testing of gene function. An example of a constitutive 
promoter is 35S CaMV, available from Clonetech. 
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[0013] The term "expression cassette" refers to a double stranded DNA sequence which 
contains both promoters and genes such that expression of a given gene is acheived upon 
insertion of the expression cassette into a plant cell. 

[0014] The term "plant" includes whole plants and portions of plants, including plant 
organs (e.g. roots, stems, leaves, etc.) 

[0015] The term "angiosperm" refers to plants which produce seeds encased in an ovary. A 
specific example of an angiosperm is Liquidambar styraciflua (L.)[sweetgum]. The 
angiosperm sweetgum produces syringyl lignin. 

[0016] The term "gymnosperm" refers to plants which produce naked seeds, that is, seeds 
which are not encased in an ovary. A specific example of a gymnosperm is Pinus taeda 
(L7)Q£loblolly pine]. The gymnosperm loblolly pine does not produce syringyl lignin. 
Summary of the Invention 

[0017] With regard to the above and other objects, the invention provides a method for 
inducing production of syringyl lignin in gymnosperms and to gymnosperms which contain 
syringyl lignin for improved delignification in the production of pulp for papermaking and 
other applications. In accordance with one of its aspects, the invention involves cloning an 
angiosperm DNA sequence which codes for enzymes involved in production of syringyl 
lignin monomer units, fusing the angiosperm DNA sequence to a lignin promoter region to 
form an expression cassette, and inserting the expression cassette into a gymnosperm 
genome. 

[0018] Enzymes required for production of syringyl lignin in an angiosperm are obtained 
by deducing an amino acid sequence of the enzyme, extrapolating an mRNA sequence from 
the amino acid sequence, constructing a probe for the corresponding DNA sequence and 
cloning the DNA sequence which codes for the desired enzyme. A promoter region specific 
to a gymnosperm lignin biosynthesis gene is identified by constructing a probe for a 
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gymnosperm lignin biosynthesis gene, sequencing the 5' flanking region of the DNA which 
encodes the gymnosperm lignin biosynthesis gene to locate a promoter sequence, and then 
cloning that sequence. 

[0019] An expression cassette is constructed by fusing the angiosperm syringyl lignin DNA 
sequence to the gymnosperm promoter DNA sequence. Alternatively, the angiosperm 
syringyl lignin DNA is fused to a constitutive promoter to form an expression cassette. The 
expression cassette is inserted into the gymnosperm genome to transform the gymnosperm 
genome. Cells containing the transformed genome are selected and used to produce a 
transformed gymnosperm plant containing syringyl lignin. 

[0020] In accordance with the invention, the angiosperm gene sequences bi-OMT, 4CL, 
FA5H P450 -1 and FA5H P-450 -2 have been determined and isolated as associated with 
production of syringyl lignin in sweetgum and lignin promoter regions for the gymnosperm 
loblolly pine have been determined to be the 5' flanking regions for the 4CL1B, 4CL3B and 
PAL gymnosperm lignin genes. Expression cassettes containing sequences of selected genes 
from sweetgum have been inserted into loblolly pine embryogenic cells and presence of 
sweetgum genes associated with production of syringyl lignin has been confirmed in daughter 
cells of the resulting loblolly pine embryogenic cells. 

[0021] The invention therefore enables production of gymnosperms such as loblolly pine 
containing genes which code for production of syringyl lignin, to thereby produce in such 
species syringyl lignin in the wood structure for enhanced pulpability. 
Brief Description of the Drawings 

[0022] The above and other aspects of the invention will now be further described in the 
following detailed specification considered in conjunction with the following drawings in 
which: 

[0023] FIG. 1 illustrates a generalized pathway for lignin synthesis; and 

4 



WASH_1 634660.1 



[0024] FIG. 2 illustrates FIGS. 2A-2E illustrate a bifunctional-O-methyl transferase (bi- 
OMT) gene sequence involved in the production of syringyl lignin in an angiosperm (SEQ ID 
3 5 coding SEP ID 6 ); 

[0025] FIG. 3 illustrates FIGS. 3A-3G illustrate a 4-coumarate CoA ligase ( 4CL) gene 
sequence involved in the production of syringyl lignin in an angiosperm (SEQ ID 4 7 coding 
SEP ID 8 ); 

[0026] FIG. 4 illustrates a ferulic acid-5-hydroxylase ( FA5H P450 -1) gene sequence 
involved in the production of syringyl lignin in an angiosperm (SEQ ID 1 coding SEP ID 2) ; 
[0027] FIG. 5 illustrates a ferulic acid-5-hydroxylase ( TA5H P450 -2) gene sequence 
involved in the production of syringyl lignin in an angiosperm (SEQ ID S3 coding SEP ID 
4); 

[0028] FIG. 6 illustrates nucleotide sequences of the 5' flanking region of the loblolly pine 
4CL1B gene showing the location of regulatory elements for lignin biosynthesis (SEQ ID 
610); 

[0029] FIG. 7 illustrat e s FIGS 7A-7B illustrate nucleotide sequences of the 5' flanking 
region of the loblolly pine 4CL3B gene showing the location of regulatory elements for lignin 
biosynthesis (SEQ ID ?!!); 

[0030] FIG. 8 illustrat e s FIGS. 8A-8B illustrate nucleotide sequences of the 5' flanking 
region of loblolly pine PAL gene showing the location of regulatory elements for lignin 
biosynthesis (SEQ ID S9); 

[0031] FIG. 9 illustrates a PCR confirmation of the sweetgum FA5H P450 -1 gene sequence 
in transgenic loblolly pine cells; and 
Detailed Descrioption of the Invention 

[0032] In accordance with the invention, a method is provided for modifying a 
gymnosperm genome, such as the genome of a loblolly pine, so that syringyl lignin will be 

5 

WASH..1 634660.1 



produced in the resulting plant, thereby enabling cellulosic fibers of the same to be more 
easily separated from lignin in a pulping process. In general, this is accomplished by fusing 
one or more angiosperm DNA sequences (referred to at times herein as the "ASL DNA 
sequences") which are involved in production of syringyl lignin to a gymnosperm lignin 
promoter region (referred to at times herein as the "GL promoter region") specific to genes 
involved in gymnosperm lignin biosynthesis to form a gymnosperm syringyl lignin 
expression cassette (referred to at times herein as the "GSL expression cassette"). 
Alternatively, the one or more ASL DNA sequences are fused to one or more constitutive 
promoters to form a GSL expression cassette. 

[0033] The GSL expression cassette preferably also includes selectable marker genes which 
enable transformed cells to be differentiated from untransformed cells. The GSL expression 
cassette containing selectable marker genes is inserted into the gymnosperm genome and 
transformed cells are identified and selected, from which whole gymnosperm plants may be 
produced which exhibit production of syringyl lignin. 

[0034] To suppress production of less preferred forms of lignin in gymnosperms, such as 
guaiacyl lignin, genes from the gymnosperm associated with production of these less 
preferred forms of lignin are identified, isolated and the DNA sequence coding for anti-sense 
mRNA (referred to at times herein as the "GL anti-sense sequence") for these genes is 
produced. The DNA sequence coding for anti-sense mRNA is then incorporated into the 
gymnosperm genome, which when expressed bind to the less preferred guaiacyl gymnosperm 
lignin mRNA, inactivating it. 

[0035] Further features of these and various other steps and procedures associated with 
practice of the invention will now be described in more detail beginning with identification 
and isolation of ASL DNA sequences of interest for use in inducing production of syringyl 
lignin in a gymnosperm. 
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I. Determination Of DNA Sequence For Genes Associated With Production Of Syringyl 
Lignin 

[0036] The general biosynthetic pathway for production of lignin has been postulated as 
shown in FIG. 1. From FIG. 1, it can be seen that the genes CCL, OMT and F5H (which is 
from the class of P450 genes) may play key roles in production of syringyl lignin in some 
plant species, but their specific contributions and mechanisms remain to be positively 
established. It is suspected that the CCL, OMT and F5H genes may have specific equivalents 
in a specific angiosperm, such as sweetgum. Accordingly, one aim of the present invention is 
to identify, sequence and clone specific genes of interest from an angiosperm such as 
sweetgum which are involved in production of syringyl lignin and to then introduce those 
genes into the genome of a gymnosperm, such as loblolly pine, to induce production of 
syringyl lignin. 

[0037] Genes of interest may be identified in various ways, depending on how much 
information about the gene is already known. Genes believed to be associated with 
production of syringyl lignin have already been sequenced from a few angiosperm species, 
viz, CCL and OMT. 

[0038] DNA sequences of the various CCL and OMT genes are compared to each other to 
determine if there are conserved regions. Once the conserved regions of the DNA sequences 
are identified, oligo-dT primers homologous to the conserved sequences are synthesized. 
Reverse transcription of the DNA-free total RNA which was purified from sweetgum xylem 
tissue, followed by double PCR using gene-specific primers, enables production of probes for 
the CCL and OMT genes. 

[0039] A sweetgum cDNA library is constructed in a host, such as lambda ZAPII, available 
from Stratagene, of LaJolla, Calif, using poly(A)+RNA isolated from sweetgum xylem, 
according to the methods described by Bugos et al. (1995 Biotechniques 19:734-737). The 
above mentioned probes are used to assay the sweetgum cDNA library to locate cDNA which 
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codes for enzymes involved in production of syringyl lignin. Once a syringyl lignin sequence 
is located, it is then cloned and sequenced according to known methods which are familiar to 
those of ordinary skill. 

[0040] In accordance with the invention, two sweetgum syringyl lignin genes have been 
determined using the above-described technique. These genes have been designated 4CL and 
bi-OMT. The sequence obtained for the sweetgum syringyl lignin gene, designated bi-OMT, 
is illustrated in FIG. 2 (SEQ ID 3 -5 and 6 ). The sequence obtained for the sweetgum syringyl 
lignin gene, designated 4CL, is illustrated in FIG. 3 f SEO ID 4 SEP ID 7 and 8 ). 
[0041] An alternative procedure was employed to identify the F5H equivalent genes in 
sweetgum. Because the DNA sequences for similar P450 genes from other plant species were 
known, probes for the P450 genes were designed based on the conserved regions found by 
comparing the known sequences for similar P450 genes. The known P450 sequences used for 
comparison include all plant P450 genes in the GenBank database. Primers were designed 
based on two highly conserved regions which are common to all known plant P450 genes. 
The primers were then used in a PCR reaction with the sweetgum cDNA library as a 
template. Once P450-like fragments were located, they were amplified using standard PCR 
techniques, cloned into a pBluescript vector available from Clonetech of Palo Alto, Calif, and 
transformed into a DH5. alpha. E. coli strain available from Gibco BRL of Gaithersburg, Md. 
[0042] After E. coli colonies were tested in order to determine that they contained the 
P450-like DNA fragments, the fragments were sequenced. Several P450-like sequences were 
located in sweetgum using the above described technique. One P450-like sequence was 
sufficiently different from other known P450 sequences to indicate that it represented a new 
P450 gene family. This potentially new P450 cDNA fragment was used as a probe to screen a 
full length clone from the sweetgum xylem library. This -These p utative hydroxylase clon e 
wa r, d es ignated FA5H P450-i clones were designated P450-1 and P450-2 . The sequence 
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obtained for FA5W -P450-1 and P450-2 are i s-illustrated in FIG. 4 (SEQ ID 4 -1 and 2) and 
FIG. 5 (SEP ID 3 and 4) . 

II. Identification Of GL Gene Promoter Regions 

[0043] In order to locate gymnosperm lignin promoter regions, probes are developed to 
locate lignin genes. After the-gymnosperm lignin gene is located, the portion of DNA 
upstream from the gene is sequenced, preferably using the Genome Walker Kit, available 
from Clonetech. The portion of DNA upstream from the lignin gene will generally contain 
the gymnosperm lignin promoter region. 

[0044] Gymnosperm genes of interest include CCL-like genes and PAL-like genes, which 
are beleived to be involved in the production of lignin in gymnosperms. Preferred probe 
sequences are developed based on previously sequenced genes, which are available from thea 
gene bank. The preferred gene bank accession numbers for the CCL-like genes include 
U39404 and U39405. A preferred gene bank accession number for a PAL-like gene is 
U39792. Probes for such genes are constructed according to methods familiar to those of 
ordinary skill in the art. A genomic DNA library is constructed and DNA fragments which 
code for gymnosperm lignin genes are then identified using the above mentioned probes. A 
preferred DNA library is obtained from the gymnosperm, Pinus taeda (L.)[Loblolly Pine], 
and a preferred host of the genomic library is Lambda Dashll, available from Stratagene of 
LaJolla, Calif. 

[0045] Once the DNA fragments which code for the gymnosperm lignin genes are located, 
the genomic region upstream from the gymnosperm lignin gene (the 5' flanking region) was 
identified. This region contains the GL promoter. Three promoter regions were located from 
gymnosperm lignin biosynthesis genes. The first is the 5' flanking region of the loblolly pine 
4CL1B gene, shown in FIG. 6 (SEQ ID 610). The second is the 5* flanking region of the 
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loblolly pine gene 4CL3B, shown in FIG. 7 (SEQ ID 1U). The third is the 5' flanking region 
of the loblolly pine gene PAL, shown in FIG. 8 (SEQ ID $9). 

III. Fusing The GL Promoter Region To The ASL DNA Sequence 

[0046] The next step of the process is to fuse the GL promoter region to the ASL DNA 
sequence to make a GSL expression cassette for insertion into the genome of a gymnosperm. 
This may be accomplished by standard techniques. In a preferred method, the GL promoter 
region is first cloned into a suitable vector. Preferred vectors are pGEM7Z, available from 
Promega, Madison, Wis. and SK available from Stratagene, of LaJolla, Calif. After the 
promoter sequence is cloned into the vector, it is then released with suitable restriction 
enzymes. The ASL DNA sequence is released with the same restriction enzyme(s) and 
purified. 

[0047] The GL promoter region sequence and the ASL DNA sequence are then ligated such 
as with T4 DNA ligase, available from Promega, to form the GSL expression cassette. Fusion 
of the GL and ASL DNA sequence is confirmed by restriction enzyme digestion and DNA 
sequencing. After confirmation of GL promoter-ASL DNA fusion, the GSL expression 
cassette is released from the original vector with suitable restriction enzymes and used in 
construction of vectors for plant transformation. 

IV. Fusing The ASL DNA Sequence to a Constitutive Promoter Region 

[0048] In an alternative embodiment, a standard constitutive promoter may be fused with 
the ASL DNA sequence to make a GSL expression cassette. For example, a standard 
constitutive promoter may be fused with FA5H P450 -1 to form an expression cassette for 
insertion of FA5H P450 -1 sequences into a gymnosperm genome. In addition, a standard 
constitutive promoter may be fused with FA5H P450 -2 to form an expression cassette for 
insertion of FA5H P450 -2 into a gymnosperm genome. A constitutive promoter for use in the 
invention is the double 35S promoter, available from Clonetech. 
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[0049] In the preferred practice of the invention using constitutive promoters, a suitable 
vector such as pBipBI221 5 is digested Xbal and Hindlll to release the 35 S promoter. At the 
same time the vector pHygro, available from International Paper, was disgested by Xbal and 
Hindlll to release the double 35S promoter. The double 35S promoter was ligated to the 
previously digested pBipBI221 vector to produce a new pBipBI221 with the double 35S 
promoter. This new pBipBI221 was digested with Sad and Smal, to release the GUS 
fragment. The vector is next treated with T4 DNA polymerase to produce blunt ends and the 
vector is self-ligated. This vector is then further digested with BamHI and Xbal, available 
from Promega. After the pBipBI221 vector containing the constitutive promoter region has 
been prepared, lignin gene sequences are prepared for insertion into the pBipBI221 vector. 
[0050] The coding regions of sweetgum FA5H P450 -1 or FA5H P450 -2 are amplified by 
PCR using primer with restriction sites incorporated in the 5' and 3' ends. In one example, an 
Xbal site was incorporated at the 5' end and a BamB HBamnHI site was incorporated at the 3' 
end of the sweetgum FA5H P450 -1 or FA5H P450 -2 genes. After PCR, the FA5H P450 -1 and 
FA5H P450 -2 genes were separately cloned into a TA vector available from Invitrogen. The 
TA vectors containing the FA5H P450 -1 and FA5H P450 -2 genes, respectively, were digested 
by Xbal and BamHI to release the FA5H P450 -1 or FA5H P450 -2 sequences. 
[0051] The p35SS vector, described above, and the isolated sweetgum FA5H P450 -1 or 
FA5H P450 -2 fragments were then ligated to make GLS expression cassettes containing the 
con s tiutiv e constitutive p romoter. 

V. Inserting the Expression Cassette into the Gvmnosperm Genome 
[0052] There are a number of methods by which the GSL expression cassette may be 
inserted into a target gymnosperm cell. One method of inserting the expression cassette into 
the gymnosperm is by micro-projectile bombardment of gymnosperm cells. For example, 
embryogenic tissue cultures of loblolly pine may be initiated from immature zygotic 
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embryos. Tissue is maintained in an undifferentiated state on semi-solid proliferation 
medium. For transformation, embryogenic tissue is s; suspended in liquid proliferation 
medium. Cells are then sieved through, a preferably 40 mesh screen, to separate small, 
densely cytoplasmic cells from large vacuolar cells. 

[0053] After separation, a portion of the liquid cell suspension fraction is vacuum deposited 
onto filter paper and placed on semi-solid proliferation medium. The prepared gymnosperm 
target cells are then grown for several days on filter paper discs in a petri dish. 
[0054] A 1 : 1 mixture of plasmid DNA containing the selectable marker expression cassette 
and plasmid DNA containing the FA5H P450 -1 expression cassette may be precipitated with 
gold to form microprojectiles. The microprojectiles are rinsed in absolute ethanol and aliqots 
are dried onto a suitable macrocarrier such as the macrocarrier available from BioRad in 
Hercules, Calif. 

[0055] Prior to bombardment, embryogenic tissue is preferably desiccated under a sterile 
laminar-flow hood. The desiccated tissue is transferred to semi-solid proliferation medium. 
The prepared microprojectiles are accelerated from the macrocarrier into the desiccated target 
cells using a suitable apparatus such as a BioRad PDS-1000/HE particle gun. In a preferred 
method, each plate is bombarded once, rotated 1 80 degrees, and bombarded a second time. 
Preferred bombardment parameters are 1350 psi rupture disc pressure, 6 mm distance from 
the rupture disc to macrocarrier (gap distance), 1 cm macrocarrier travel distance, and 10 cm 
distance from macrocarrier stopping screen to culture plate (microcarrier travel distance). 
Tissue is then transferred to semi-solid proliferation medium containing a selection agent, 
such as hygromycin B, for two days after bombardment. 

[0056] Other methods of inserting the GSL expression cassette include use of silicon 
carbide whiskers, transformed protoplasts, Agrobacterium vectors and electroporation. 
VI. Identifying Transformed Cells 
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[0057] In general, insertion of the GSL expression cassette will typically be carried out in a 
mass of cells and it will be necessary to determine which cells harbor the recombinant DNA 
molecule containing the GSL expression cassette. Transformed cells are first identified by 
their ability to grow vigorously on a medium containing an antibiotic which is toxic to non- 
transformed cells. Preferred antibiotics are kanamycin and hygromycin B. Cells which grow 
vigorously on antibiotic containing medium are further tested for presence of either portions 
of the plasmid vector, the syringyl lignin genes in the GSL expression cassette; e.g. the 
angiosperm bi-OMT, 4CL, FA5H P450 -1 or FA5H P450 -2 gene, or by testing for presence of 
other fragments in the GSL expression cassette. Specific methods which can be used to test 
for presence of portions of the GSL expression cassette include Southern blotting with a 
labeled complementary probe or PCR amplification with specific complementary primers. In 
yet another approach, an expressed syringyl lignin enzyme can be detected by Western 
blotting with a specific antibody, or by assaying for a functional property such as the 
appearance of functional enzymatic activity. 

VII. Production of a Gvmnosperm Plant from the Transformed Gvmnosperm Cell 
[0058] Once transformed embryogenic cells of the gymnosperm have been identified, 
isolated and multiplied, they may be grown into plants. It is expected that all plants resulting 
from transformed cells will contain the GSL expression cassette in all their cells, and that 
wood in the secondary growth stage of the mature plant will be characterized by the presence 
of syringyl lignin. 

[0059] Transgenic embryogenic cells are allowed to replicate and develop into a somatic 
embryo, which are then converted into a somatic seedling. 

VIII. Identification, Production and Insertion of a GL mRNA Anti-Sense Sequence 
[0060] In addition to adding ASL DNA sequences, anti-sense sequences may be 
incorporated into a gymnosperm genome, via GSL expression cassettes, in order to suppress 
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formation of the less preferred native gymnosperm lignin. To this end, the gymnosperm 
lignin gene is first located and sequenced in order to determine its nucleotide sequence. 
Methods for locating and sequencing amino acids which have been previously discussed may 
be employed. For example, if the gymnosperm lignin gene has already been purified, 
standard sequencing methods may be employed to determine the DNA nucleic acid sequence. 
[0061] If the gymnosperm lignin gene has not been purified and functionally similar DNA 
or mRNA sequences from similar species are known, those sequences may be compared to 
identify highly conserved regions and this information used as a basis for the construction of 
a probe. A gymnosperm cDNA or genomic library can be probed with the above mentioned 
sequences to locate the gymnosperm lignin cDNA or genomic DNA. Once the gymnosperm 
lignin DNA is located, it may be sequenced using standard sequencing methods. 
[0062] After the DNA sequence has been obtained for a gymnosperm lignin sequence, the 
complementary anti-sense strand is constructed and incorporated into an expression cassette. 
For example, the GL mRNA anti-sense sequence may be fused to a promoter region to form 
an expression cassette as described above. In a preferred method, the GL mRNA anti-sense 
sequence is incorporated into the previously discussed GSL expression cassette which is 
inserted into the gymnosperm genome as described above. 

IX. Inclusion of Cytochrome P450 Reductase (CPK) to Enhance Biosynthesis Of Syringyl 
Lignin in Gymnosperms 

[0063] In the absence of external cofactors such as NADPH (an electron donor in reductive 
biosyntheses), certain angiosperm lignin genes such as the FA5H P450 genes may remain 
inactive or not acheive full or desired activity after insertion into the genome of a 
gymnosperm. Inactivity or insufficient activity can be determined by testing the resulting 
plant which contains the FA5H P450 genes for the presence of syringyl lignin in secondary 
growth. It is known that cytochrome P450 reductase (CPR) may be involved in promoting 
certain reductive biochemical reactions, and may activate the desired expression of genes in 
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many plants. Accordingly, if it is desired to enhance the expression of the angiosperm 
syringyl lignin genes in the gymnosperm, CPR may be inserted in the gymnosperm genome. 
In order to express CPR, the DNA sequence of the enzyme is ligated to a constitutive 
promoter or, for a specific species such as loblolly pine, xylem-specific lignin promoters such 
as PAL, 4CL1B or 4CL3B to form an expression cassette. The expression cassette may then 
be inserted into the gymnosperm genome by various methods as described above. 
X. Examples 

[0064] The following non-limiting examples illustrate further aspects of the invention. In 
these examples, the angiosperm is Liquidambar styraciflua (L.)[sweetgum] and the 
gymnosperm is Pinus taeda (L.)[loblolly pine]. The nomenclature for the genes r e fer e d 
referred to in the examples is as follows: :?* 



Genes 


Biochemical Name ? 


4CL (angiosperm) 


4-coumarate CoA ligase 


bi-OMT (angiosperm) 


bifunctional-O-methyl transferase 1 


FA5HP450-1 (angiosperm) 


Cvtochrome P450ferulic acid 5 hydroxylase 


FA5HP450-2 (angiosperm) 


Cytochrome P450forulic acid 5 hydroxylase 


PAL (gymnosperm) 


phenylalanine ammonia-lyase 


4CL1B (gymnosperm) 


4-coumarate CoA ligase 


4CL3B (gymnosperm) 


4-coumarate CoA ligase 



Example 1 - Isolating and Sequencing bi-OMT and 4CL Genes from an Angiosperm 
[0065] A cDNA library for Sweetgum was constructed in Lambda ZAPII, available from 
Stratagene, of LaJolla, Calif., using poly(A)+RNA isolated from Sweetgum xylem tissue. 
Probes for bi-OMT and 4CL were obtained through reverse transcription of their mRNAs and 
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followed by double PCR using gene-specific primers which were designed based on the 
OMT and CCL cDNA sequences obtained from similar genes cloned from other species. 
[0066] Three primers were used for amplifying OMT fragments. One was an oligo-dT 
primer . One was a -bi-OMT, (which was cloned through used to clone gene fragments 
through modified differential display technique, as described below in Example 2) and the 
other two were degenerate primers, which were based on the conserved sequences of all 
known OMTs. The two degenerate primers were derived based on the following amino acid 
sequences: 

5'-Gly Gly Met Ala Thr Tyr Cys Cys Ala Thr Thr Tyr Ala Ala Cys Ala Ala 
Gly Gly Cys-3' (primer #22) (SEP ID 12) and 

3'-Ala Ala Ala Gly Ala Gly Ala Gly Asn Ala Cys Asn Asn Ala Asn Asn Ala 
Asn Gly Ala-5' (primer #23) TSEO ID 13) . 

[0067] A 900 bp PCR product was produced when oligo-dT primer and primer #22 were 
used, and a 550 bp fragment was produced when primer numbers 22 and 23 were used. 
[0068] Three primers were used for amplifying CCL fragments. They were derived from 
the following amino acid sequences: 

5'-Thr Thr Gly Gly Ala Thr Cys Cys Gly Gly He Ala Cys He Ala Cys He Gly 
Gly He Tyr Thr He Cys Cys He Ala Ala Arg Gly Gly-3 f (primer R1S) (SEP ID 14) 

5'-Thr Thr Gly Gly Ala Thr Cys Cys Gly Thr He Gly Thr He Gly Cys He Cys 
Ala Arg Cys Ala Arg Gly Thr He Gly Ala Tyr Gly Gly-3' (primer HIS) (SEP ID 15) and 

3'-Cys Cys He Cys Thr Tyr Thr Ala Asp Ala Cys Arg Thr Ala Asp Gly Cys He 
Cys Cys Ala Gly Cys Thr Gly Thr Ala-5' (primer R2A) fSEO ID 16) 

[0069] Rl S and HIS were both sense primers. Primer R2A was an anti-sense primer. A 650 
bp fragment was produced if Rl S and R2A primers were used and a 550 bp fragment was 

16 

WASH_1 634660.1 



produced when primers HIS and R2A were used. The sequence of these three primers were 
derived from conserved sequences for plant CCLs. 

[0070] The reverse transcription-double PCR cloning technique used for these examples 
consisted of adding 10 jjtgj±ffl-ef DNA-free total RNA in 25 jxljbtg-DEPC-treated water to a 
microfuge tube. Next, the following solutions were added: 

a. 5x Reverse transcript buffer 8.0 jutl, 

b. 0.1 iiM DDT MDTT 4.0 \il 

c. 10mMdNTP2.0 jil 

d. 100 |iM oligo-dT primers 8.0 [il 

e. Rnasin 2.0 \il 

f. Superscript II 1.0 jal 

[0071] After mixing, the tube was incubated at a temperature of 42° C. for one (1) hour, 
followed by incubation at 70° C. for fifteen (15) minutes. Forty (40) jal of IN NaOH was 
added and the tube was further incubated at 68° C. for twenty (20) minutes. After the 
incubation periods, 80 |il of IN HC1 was added to the reaction mixture. At the same time, 17 
|il NaOAc, 5 [i\ glycogen and 768 |il of 100% ethanol were added and the reaction mixture 
was maintained at -80° C. for 15 minutes in order to precipitate the cDNA. The precipitated 
cDNA was centrifuged at high speed at 4° C. for 15 minutes. The resulting pellet was washed 
with 70% ethanol and then dried at room temperature, and then was dissolved in 20 jal of 
water. 

[0072] The foregoing procedure produced purified cDNA which was used as a template to 
carry out first round PCR using primers #22 and oligo-dT for cloning OMT cDNA and 
primer R1S and R2A for cloning 4CL cDNA. For the first round PCR, a master mix of 50 (il 
for each reaction was prepared. Each 50 jxl mixture contained: 
a. lOx buffer 5 j^l 
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b. 25 mM MgCl 2 5 \il 

c. 100 jiM sense primer 1 pi (primer #22 for OMT and primer Rl S for CCL). 

d. 100 jil anti-sense primer 1 jxl (oligo-dT primer for OMT and R2A for CCL). 

e. lOmMdNTP 1 |xl 

f. Taq. DNA polymerase 0.5 jal 

[0073] Of this master mix, 48 pi was added into a PCR tube containing 2 [il of cDNA for 
PCR. The tube was heated to 95° C. for 45 seconds, 52° C. for one minute and 72° C. for two 
minutes. This temperature cycle was repeated for 40 cycles and the mixture was then held at 
72° C. for 10 minutes. 

[0074] The cDNA fragments obtained from the first round of PCR were used as templates 
to perform the second round of PCR using primers 22 and 23 for cloning bi-OMT cDNA and 
primer HIS and R2A for cloning 4CL cDNA. The second round of PCR conditions were the 
same as the first round. 

[0075] The desired cDNA fragment was then sub clon e d subcloned and sequenced. After 
the second round of PCR, the product with the predicted size was excised from the gel and 
ligated into a pUC19 vector, available from Clonetech, of Palo Alto, Calif, and then 
transformed into DH5. alpha., an E. coli strain, available from Gibco BRL, of Gaithersburg, 
Md. After the inserts had been checked for correct size, the colonies were isolated and 
plasmids were sequenced using a Sequenase kit available from USB, of Cleveland, Ohio. The 
sequences are shown in FIG. 2 (SEQ ID 3 5 and 6) and FIG. 3 fSEQ4& 4 SEP ID 7 and 8) . 
Example 2 - Alternative Isolation Method of Angiosperm bi-OMT Gene 
[0076] As previously mentioned, one bi-OMT clone was produced via modified differential 
display technique. This method is another type of reverse transcription-PCR, in which DNA- 
free total RNA was reverse transcribed using oligo-dT primers with a single base pair anchor 
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to form cDNA. The oligo-dT primers used for reverse transcription of mRNA to synthesize 
cDNA were: 



[0077] These cDNAs were then used as templates for radioactive PCR which was 
conducted in the presence of the same oligo-dT primers as listed above, a bi-OMT gene- 
specific primer and 35S-dATP. The OMT gene-specific primer was derived from the 
following amino acid sequence: 

5'-Cys Cys Asn Gly Gly Asn Gly Gly Ser Ala Arg Gly Ala-3\ (SE ID 20) 
[0078] The following PCR reaction solutions were combined in a microfuge tube: 



a. H 2 0 9.2 |il, 

b. Taq Buffer 2.0 |il 

c. dNTP (25 nM)1.6 ^1 

d. Primers (5 jaM) 2 |il, for each primer 

e. 35 S-dATP 1 ^il 

f. Taq. pol. 0.2 |il 

g. cDNA 2.0 ixl 



[0079] The tube was heated to a temperature of 94° C. and held for 45 seconds, then at 37° 
C. for 2 minutes and then 72° C. for 45 seconds for forty cycles, followed by a final reaction 
at 72° C. for 5 minutes. 

[0080] The amplified products were fractionated on a denaturing polyacrylamide 
sequencing gel and autoradiography was used to identify and excise the fragments with a 
predicted size. The designed OMT gene-specific primer had a sequence conserved in a region 



4 Tl 1 A: TTTTTTTTTTTTTTA, (SEP ID 17) 



Tl 1C: TTTTTTTTTTTTTTC, (SEP ID 18) and 
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toward the 3'-end of the OMT cDNA sequence. This primer, together with oligo-dT, was 
amplified into a OMT cDNA fragment of about 300 bp. 

[0081] Three oligo-dTs with a single base pair of A, C or G, respectively, were used to pair 
with the OMT gene-specific primer. Eight potential OMT cDNA fragments with predicted 
sizes of about 300 bp were excised from the gels after several independent PCR rounds using 
different combinations of oligo-dT and OMT gene-specific oligo-nucleotides as primers. 
[0082] The OMT cDNA fragments were then re-amplified. A Southern blot analysis was 
performed for the resulting cDNAs using a 360 base-pair, 32 P radio-isotope labeled, aspen 
OMT cDNA 3'-end fragment as a probe to identify the cDNA fragments having a strong 
hybridization signal, under low stringency conditions. Eight fragments were identified. Out of 
these eight cDNA fragments, three were selected based on their high hybridization signal for 
sub-cloning and sequencing. One clone, LsOMT3'-l, (where the "Ls" prefix indicates that the 
clone was derived from the Liquidambar styraciflua (L.) genome) was confirmed to encode 
bi-OMT based on its high homology to other lignin-specific plant OMTs at both nucleotide 
and amino acid sequence levels. 

[0083] A cDNA library was constructed in Lambda ZAP II, available from Stratagene, of 
LaJolla, Calif, using 5 mg poly(A)+RNA isolated from sweetgum xylem tissue. The primary 
library consisting of approximately 0.7x1 0 6 independent recombinants was amplified and 
approximately 10 5 plaque-forming-units (pfu) were screened using a homologous 550 base- 
pair probe. The hybridized filter was washed at high stringency (0.25xSSC, 0.1% SDS, 65° 
C.) conditions. The colony containing the bi-OMT fragment identified by the probe was 
eluted and the bi-OMT fragment was produced. The sequence as illustrated in FIG. 2 (SEQ 
ID and 6 ) was obtained. 

Example 3 - Isolating and Producing the DNA which Codes for the Angiosperm FA5HP450- 
1 Gene 
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[0084] In order to find putative FA5H P450 cDNA fragments as probes for cDNA library 
screening, a highly degenerated sense primer based on the amino acid sequence of 5'-Glu, 
Glu, Phe, Arg, Pro, Glu, Arg-3' was designed based on the conserved regions found in some 
plant P450 proteins. This conserved domain was located upstream of another highly 
conserved region in P450 proteins, which had an amino acid sequence of 5-Phe Gly Xaa Gly 
Xaa Xaa Cys Xaa Glv-3' (SEP ID 21) . This primer was synthesized with the incorporation of 
an Xbol restriction site to give a 26-base-pair oligomer with a nucleotide sequence of 5 r ATG 
TGC AGT TTT TTT TTT TTT TIT TT-3' TSEO ID 22V 

[0085] This primer and the oligo-dT-XhoI primer were then used to perform PCR reactions 
with the sweetgum cDNA library as a template. The cDNA library was constructed in 
Lambda ZAPII, available from Stratagene, of LaJolla, Calif., using poly(a)+RNA isolated 
from Sweetgum xylem tissue. Amplified fragments of 300 to 600 bp were obtained. Because 
the designed primer was located upstream of the highly conserved P450 domain, this design 
distinguished whether the PCR products were P4 50 gene fragments depending on whether 
they contained the highly conserved amino acid domain. 

[0086] All the fragments obtained from the PCR reaction were then cloned into a pUC19 
vector, available from Stratagene, of LaJolla, Calif, and transformed into a DHS.alpha. E. 
coli strain, available from Gibco BRL, of Gaithersburg, Md. 

[0087] Twenty-four positive colonies were obtained and sequenced. Sequence analysis 
indicated four groupings withing within the twenty- four colonies. One was Oil 1 C4H , one 
was an unknown P450 gene, and two did not belong to P450 genes. Homologies of P450 
genes in different species are usually more than 80%. Because the homologies between the 
P450 gene families found here were around 40%, the sequence analysis indicated that a new 
P450 gene family was sequenced. Moreover, since this P450 cDNA was isolated from xylem 
tissue, it was highly probable that this P450 gene was FA5H P450 -1. 
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[0088] The novel sweetgum P450 cDNA fragment was used as a probe to screen a full 
length cDNA encoding for FA5H P450 -1 . Once the FA5H P450 -1 gene was located it was 
sequenced. The length of the FA5H P450 -1 cDNA is 1707 bp and it contains 45 bp of 5' non- 
coding region and 135 bp of 3' non-coding region. The deduced amino acid sequence also 
indicates that this P450 cDNA has a hydrophobic core at the N-terminal, which could be 
regarded as a leader sequence for c-translational targeting to membranes during protein 
synthesis. At the C-terminal region, there is a heme binding domain that is characteristic of 
all P450 genes. The FA5H P450 -1 sequence, as illustrated in FIG. 4 (SEQ ID 1 and 2 ), was 
produced, according to the above described methods. 

Example 4 - Isolating and Producing the DNA which Codes for the Angiosperm FA5HP450- 
2 Gene 

[0089] By using similar strategy of synthesizing PCR primers from the published literature 
for hydroxylase genes in plants, another full length FA5H P450 cDNA has been isolated that 
shows significant similarity with a putitive F5H clone from Arabidopsis (Meyers et al. 1996: 
PNAS 93, 6869-6874). This cloned cDNA, designated FA5H P450 -2, contains 1883 bp and 
encodes an open reading frame of 51 1 amino acids. The amino acid similarity shared between 
Arabidopsis P5H -FSH and the FA5H P450 -2 sweetgum clone is about 75% , indicating that 
th e isolat e d clone belongs to th e same class of cDNAs that encode a F5H protein, which has 
boon shown to b e functional by genetic complim e nlation in Arabidopsis . 
[0090] To confirm the function of the FA5H P450 -2 gene, it was expressed in E.coli, strain, 
DH5 alpha, via pQE vector preparation, according to directions available with the kit. A CO-- 
Fe2+binding assay was also performed to confirm the expression of FA5H P450 -2 as a 
functional P450 gene. (Omura & Sato 1964, J. of Biochemistry 239: 2370-2378, Babriac 
et.al. 1991 Archives of Biochemistry and Biophysics 288:302-309). The CO~Fe2+ binding 
assay showed a peak at 450 nm which indicates that FA5H P450 -2 has been overexpressed as 
a functional P450 gene. 
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[0091] The FA5H P450 -2 protein was further purified for production of antibodies in 
rabbits, and antibodies have been successfully produced. In addition, Western blots show that 
this antibody is specific to the membrane fraction of sweetgum and aspen xylem extract. 
When the FA5H P450 -2 antibody was added to a reaction mixture containing aspen xylem 
tissue, enzyme inhibition studies showed that the activity of FA5H P450 in aspen was reduced 
more than 60%, a further indication that FASH 2 performs a P450 lik e function. FIG. 5 (SEQ 
ID 2) illustrat e s th e FASH 2 sequ e nc e . P450-2 performs a p4501ike function. Recombinant 
P450-2 protein co-expressed with Arabidopsis CPR protein in a baculovirus expression 
system hvdroxvlated ferulic acid f specific activity: 7.3 pKat/mg protein), cinnaminic acid 
(specific activity: 25 pKat/mg protein, and p-coumeric acid (specific activity 3.8 pKat/ng 
protein). The P450-2 enzyme which may be referred to as C4C3F5-H appears to be a broad 
spectrum hydroxylase in the phenyproponoid pathway in plants FIG. 5 (SEQ ID 3 and 4) 
illustrates the P450-2 sequence. 

EXAMPLE 5 - Identifying Gvmnosperm Promoter Regions 

[0092] In order to identify gymnosperm promoter regions, sequences from loblolly pine 
PAL and 4 CL1B CL1B and 4CL3B lignin genes were used as primers to screen the loblolly 
pine genomic library, using the Genome Walker Kit. The loblolly pine PAL primer sequence 
was obtained from the GenBank, reference number U39792. The loblolly pine 4CL1B primer 
sequences were also obtained from the gene bank, reference numbers U39404 and U39405. 
[0093] The loblolly pine genomic library was constructed in Lambda Dashll, available 
from Stratagene, of LaJolla, Calif. 3x1 0 6 phage plaques from the genomic library of loblolly 
pine were screened using both the above mentioned PAL cDNA and 4CL (PCR clone) 
fragments as probes. Five 4CL clones were obtained after screening. Lambda DNAs of two 
4CL of the five 4CL clones obtained after screening were isolated and digested by EcoRV, 
PstfPstd, Sa frSall and Xbal for Southern analysis. Southern analysis using 4CL fragments as 
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probes indicated that both clones for the 4CL gene were identical. Results from further 

mapping showed that none of the original five 4CL clones contained promoter regions. When 

tested, the PAL clones obtained from the screening also did not contain promoter regions. 

[0094] In a second attempt to clone the promoter regions associated with the PAL and 4CL 

a Universal Genome Walker.TM. kit, available from CLONETECH, was used. In the process, 

total DNA from loblolly pine was digested by several restriction enzymes and ligated into the 

adaptors (libraries) provi e d provided with the kit. Two gene-specific primers for each gene 

were designed (GSP1 and 2). After two rounds of PCR using these primers and adapter 

primers of the kit, several fragments were amplified from each library. A 1.6 kb fragment and 

a 0.6 kb fragment for PAL gene and a 2.3 kb fragment (4CL1B) and a 0.7 kb fragment 

(4CL3B) for the 4CL gene were cloned, sequenced and found to contain promoter regions for 

all three genes. See FIG. 6 (SEQ ID 610), 7 (SEQ ID 711) and 8 (SEQ ID $9). 

Example 6 - Fusing the ASL DNA Sequence to A Constitutive Promoter Region and 
Inserting the Expression Cassette Into a Gvmnosperm Genome 

[0095] As a first step, a ASL DNA sequence, FA5H P450 -L was fused with a constitutive 
promoter region according to the methods described in the above Section IV to form an 
FA5H P450 -1 expression cassette. A second ASL DNA sequence, FA5H P450 -2, was then 
fused with a constitutive promoter in the same manner to form an FA5H P450 -2 expression 
cassette. The FA5H P450 -1 expression cassette was inserted into the gymnosperm genome by 
micro-projectile bombardment. Embryogenic tissue cultures of loblolly pine were initiated 
from immature zygotic embryos. The tissue was maintained in an undifferentiated state on 
semi-solid proliferation medium, according to methods described by Newton et al. TAES 
Technical Publication "Somatic Embryogenesis in Slash Pine", 1995 and Keinonen-Mettala 
et al. 1996, Scand. J. For. Res. 1 1: 242-250. 

[0096] After separation, 5 ml of the liquid cell suspension fraction which passes through the 
40 mesh screen was vacuum deposited onto filter paper and placed on semi-solid 
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proliferation medium. The prepared gymnosperm target cells were then grown for 2 days on 
filter paper discs placed on semi-solid proliferation medium in a petri dish. These target cell 
were then bombarded with plasmid DNA containing the FA5H P450 -1 expression cassette 
and an expression cassette containing a selectable marker gene encoding the enzyme which 
confers resistance to the antibiotic hygromycin B. A 1:1 mixture of of selectable marker 
expression cassette and plasmid DNA containing the FA5H P450 -1 expression cassette is 
precipitated with gold (1.5-3.0 microns) as described by Sanford et al. (1992). The DNA- 
coated microprojectiles were rinsed in absolute ethanol and aliqots aliquots of 10 \il (5 jig 
DNA/3 mg gold) were dried onto a macrocarrier, such as those available from BioRad 
(Hercules, Calif.). 

[0097] Prior to bombardment, embryogenic tissue was desiccated under a sterile laminar- 
flow hood for 5 minutes. The desiccated tissue was transferred to semi-solid proliferation 
medium. The microprojectiles were accelerated into desiccated target cells using a BioRad 
PDS-1000/HE particle gun. 

[0098] Each plate was bombarded once, rotated 180 degrees, and bombarded a second time. 
Preferred bombardment parameters were 1350 psi rupture disc pressure, 6 mm distance from 
the rupture disc to macrocarrier (gap distance), 1 cm macrocarrier travel distance, and 10 cm 
distance from macrocarrier stopping screen to culture plate (microcarrier travel distance). 
Tissue was then transferred to semi-solid proliferation medium containing hygromycin B for 
two days after bombardment. 

[0099] The FA5H P450 -2 expression cassette was inserted into the gymnosperm genome 

according to the same procedures. 

Example 7 - Selecting Transformed Target Cells 

[0100] After insertion of the FA5H P450 -1 expression cassette and the selectable marker 
expression cassette into the gymnosperm target cells as described in Example 6, transformed 
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cells were selected by exposure to an antibiotic that causes mortality of any cells not 
containing the GSL expression cassette. Forty independent cell lines were established from 
cultures cobombarded with an expression cassette containing a hygromycin resistance gene 
construct and the FA5H P450 -1 construct. These cell lines include lines Y2, Y17, Y7 and 04, 
as discussed in more detail below. 

[0101] PCR techniques were then used to verify that the FA5H P450 -1 gene had been 
successfully integrated into the genomes of ef-the established cell lines by extracting genomic 
DNA using the Plant DNAeasy kit, available from QuatgnQuaigen. 200 ng DNA from each 
cell line were used for each PCR reaction. Two FA5H P450 -1 specific primers were designed 
to perform a PCR reaction with a 600 bp PCR product size. The primers were: 

LsFft5H P450 -iml-S primer: ATGGCTTTCCTTCTAATACCCATCTC_(SEQ 

ID 23} , and 

LsFA5 HP450 -iml-A primer: GGGTGTAATGGACGAGCAAGGACTTG 

(SEO ID 24) . 

[0102] Each PCR reaction (100 |al) consisted of 75 \d H20, 1 jxl MgCl (25 mM), 10 jal PCR 
buffer 1 ja.1 10 mM dNTPs, and 10 |xl DNA. 100 [i\ oil was layered on the top of each reaction 
mix. Hot start PCR was done as follows: PCR reaction was incubated at 95 degrees C. for 7 
minutes and 1 jal each of both Ls FA5H P450 -iml-S and LsFa5 MP450 -iml-A primers (100 
(iM stock) and 1 |il of Taq polymerase were added through oil in each reaction. The PCR 
program used was 95 degrees C. for 1.5 minutes, 55 degrees C. for 45 sec and 72 degrees C. 
for 2 minutes, repeated for 40 cycles, followed by extension at 72 degrees C. for 10 minutes. 
[0103] The above PCR products were employed to determine if gymnosperm cells 
contained the angiosperm lignin gene sequences. With reference to FIG. 9, PCR 
amplification was performed using template DNA from cells which grew vigorously on 
hygromycin B-containing medium. The PCR products were el e ctrophor e sis electrophoresed 
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in an agarose gel containing 9 lanes. Lanes 14 contained PCR amplification of products of the 
Sweetgum FA5H P450 -1 gene from a non-transformed control and transgenic loblolly pine 
cell lines. Lane 1 contained the non-transformed control PT52. Lane 2 contained transgenic 
line Y2. Lane 3 contained transgenic line Y17 and Lane 4 contained the plasmid which 
contains the expression cassette pS SLsFA5 HP450 1 -im-s. Lanes 2 through 4 all contain an 
amplified fragment of about 600 bp, indicating that the FA5H P450 -1 gene has been 
successfully inserted into transgenic cell lines Y2 and Y17. 

[0104] Lane 5 contained a DNA size marker Phi 1 74/Haeffl- HaeII (BRL). The top four 
bands in this lane indicate molecular sizes of 1353, 1078, 872 and 603 bp. 
[0105] Lanes 6-9 contained PCR amplification products of hygromycin B gene from non- 
transformed control and transgenic loblolly pine cell lines. Lane 6 contained the non- 
transformed control PT52 lin e , availabl e from lane referenced to as PTS . Lane 7 

contained transgenic line Y7. Lane 8 contained transgenic line 04. Lane 9 contained the 
plasmid which includes the expression cassette containing the gene encoding the enzyme 
which confers resistance tot he antibiotic hygromycin B. Lanes 7-9 all show an amplified 
fragment of about 1000 bp, indicating that the hygromycin gene has been successfully 
inserted into transgenic lines Y7 and 04. 

[0106] These PCR results confirmed the presence of FA5H P450 -1 and hygromycin 
resistance gene in transformed loblolly pine cell cultures. The results obtained from the PCR 
verification of 4 cell lines, and similar tests with the remaining 36 cell lines, confirm stable 
integration of the FA5H P450 -1 gene and the hygromycin B gene in 25% of the 40 cell lines. 
[0107] In addition, loblolly pine embryogenic cells which have been co-bombarded with 
the FA5H P450 -2 and hygromycin B expression cassettes, are growing vigorously on 
hygromycin selection medium, indicating that the FA5H P450 -2 expression cassette was 
successfully integrated into the gymnosperm genome. 
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[0108] Although various embodiments and features of the invention have been described in 
the foregoing detailed description, those of ordinary skill will recognize the invention is 
capable of numerous modifications, rearrangements and substitutions without departing from 
the scope of the invention as set forth in the appended claims. For example, in the case where 
the lignin DNA sequence is transcribed and translated to produce a functional syringyl lignin 
gene, those of ordinary skill will recognize that because of codon degeneracy a number of 
polynucleotide sequences will encode the same gene. These variants are intended to be 
covered by the DNA sequences disclosed and claimed herein. In addition, the sequences 
claimed herein include those sequences with encode a gene having substantial functional 
identity with those claimed. Thus, in the case of syringyl lignin genes, for example, the DNA 
sequences include variant polynucleotide sequences encoding polypeptides which have 
substantial identity with the amino acid sequence of syringyl lignin and which show syringyl 
lignin activity in gymnosperms. 
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AAACCCTAAA ACTCACCTCT CTTACCCTTT CTCTTCA ATG GCT TTC 

Met Ala Phe 



CTT CTA ATA 
Leu Leu lie 



ATC TCA ATA ATC TTC ATC GTC TTA GCT TAC CAG 
lie Ser He He Phe He Val Leu Ala Tyr Gin 




CTC TAT CAA CGG (STC AGA TTT AAG CTC CCA CCC GGC CCA CGT CCA 
Leu Tyr Gin Arg Le>i Arg Phe Lys Leu Pro Pro Gly Pro Arg Pro 



TGG CCG ATC GTC GGA 
Trp Pro He Val Gly 



CGG TGT TTC GCC GAG TGG 
Arg Cys Phe Ala Glu Trp 



CTT TAC GAC ATA AAA CCG GTG AGG TTC 
Leu Tyr Asp He Lys Pro Val Arg Phe 



CAA GCG TAC GGT CCG ATC ATA TCG 
Gin Ala Tyr Gly Pro He He Ser 



GTG TGG TTC GGT TCA ACG TTG fcAT GTG ATC GTA TCG AAT TCG GAA 
Val Trp Phe Gly Ser Thr Leu Asto Val He Val Ser Asn Ser Glu 

TTG GCT AAG GAA GTG CTC AAG GAA AAA GAT CAA CAA TTG GCT GAT 
Leu Ala Lys Glu Val Leu Lys Glu Lys^ Asp Gin Gin Leu Ala Asp 

AGG CAT AGG AGT AGA TCA GCT GCC AAA TTT AGC AGG GAT GGG CAG 
Arg His Arg Ser Arg Ser Ala Ala Lys Ptte Ser Arg Asp Gly Gin 



GAC CTT ATA TGG GCT GAT TAT GGA CCT CAC 
Asp Leu He Trp Ala Asp Tyr Gly Pro His 

AAG GTT TGT ACC CTC GAG CTT TTT ACT CCA AAG 
Lys Val Cys Thr Leu Glu Leu Phe Thr Pro Lys 



GTG AAG GTT ACA 
Val Lys Val Thr 

CTT GAA GCT 
Leu Glu Ala 



CTT AGA CCC ATT AGA GAA GAT GAA GTT ACA GCC ATG^GTT GAG TCC • 
Leu Arg Pro He Arg Glu Asp Glu Val Thr Ala Met Val Glu Ser 

ATT TTT AAT GAC ACT GCG AAT CCT GAA AAT TAT GGG AAG\AGT ATG 
He Phe Asn Asp Thr Ala Asn Pro Glu Asn Tyr Gly Lys Ver Met 

CTG GTG AAG AAG TAT TTG GGA GGA GTA GCA TTC AAC AAC ATf\ACA 
Leu Val Lys Lys Tyr Leu Gly Ala Val Ala Phe Asn Asn He Vhr 

AGA CTC GCA TTT GGA AAG CGA TTC GTG AAT TCA GAG GGT GTA ATG 
Arg Leu Ala Phe Gly Lys Arg Phe Val Asn Ser Glu Gly Val Met 1 

GAC GAG CAA GGA CTT GAA TTT AAG GAA ATT GTG GCC AAT GGA CTC 
Asp Glu Gin Gly Leu Glu Phe Lys Glu He Val Ala Asn Gly Leu 

AAG CTT GGT GCC TCA CTT GCA ATG GCT GAG CAC ATT CCT TGG CTC 
Lys Leu Gly Ala Ser Leu Ala Met Ala Glu His He Pro Trp Leu 

CGT TGG ATG TTC CCA CTT GAG GAA GGG GCC TTT GCC AAG CAT GGG 
Arg Trp Met Phe Pro Leu Glu Glu Gly Ala Phe Ala Lys His Gly 

GCA CGT AGG GAC CGA CTT ACC AGA GCT ATC ATG GAA GAG CAC ACA 
Ala Arg Arg Asp Arg Leu Thr Arg Ala He Met Glu Glu His Thr 
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A1%. GCC CGT AAA AAG AGT GGT GGA GCC CAA CAA CAT TTC GTG GAT 
Ma Arg Lys Lys Ser Gly Gly Ala Gin Gin His Phe Val Asp 



GCA TTGXCTC ACC CTA CAA GAG AAA TAT GAC CTT AGC GAG GAC ACT 
Ala Leu \eu Thr Leu Gin Glu Lys Tyr Asp Leu Ser Glu Asp Thr 



ATT ATT GG< 
He He Gly 



CTT TGG GAT ATG ATC ACT GCA GGC ATG GAC ACA 
su Leu Trp Asp Met He Thr Ala Gly Met Asp Thr 



ACC GCA ATC TCl\GTC GAA TGG GCC ATG GCC GAG TTA ATT AAG AAC 
Thr Ala He Ser \al Glu Trp Ala Met Ala Glu Leu He Lys Asn 

CCA AGG GTG CAA CAA\AAA GCT CAA GAG GAG CTA GAC AAT GTA CTT 
Pro Arg Val Gin Gin ays Ala Gin Glu Glu Leu Asp Asn Val Leu 



GGG TCC GAA CGT GTC CTG 
Gly Ser Glu Arg Val Leu 



GAA TTG GAC TTC TCA AGC CTC CCT 
ar Glu Leu Asp Phe Ser Ser Leu Pro 



TAT CTA CAA TGT GTA GCC AAG 
Tyr Leu Gin Cys Val Ala Lys 



\G GCA CTA AGG CTG CAC CCT CCA 
Ala Leu Arg Leu His Pro Pro 



ACA CCA CTA ATG CTC CCT CAT CGC \3CC AAT GCC AAC GTC AAA ATT 
Thr Pro Leu Met Leu Pro His Arg Ala Asn Ala Asn Val Lys He 



GGT GGC TAC GAC ATC CCT AAG GGA TCA 
Gly Gly Tyr Asp He Pro Lys Gly Ser 



TGG GCC GTG GCT CGT GAT CCA GCA GTG TGG 
Trp Ala Val Ala Arg Asp Pro Ala Val Trp 



GTT CAT GTA AAT GTC 
Val His Val Asn Val 



GAC CCA CTA GAG 
Asp Pro Leu Glu 



TTT CGA CCG GAA CGG TTC TCT GAA GAC GAT GTC 
Phe Arg Pro Glu Arg Phe Leu Glu Asp Asp Val 



>C ATG AAA GGT 
Met Lys Gly 



CAC GAT TAT AGG CTA CTG CCG TTT GGT GCA GGG AGG 
His Asp Tyr Arg Leu Leu Pro Phe Gly Ala Gly Arg 



IT GTT TGC 
val Cys 



CCC GGT GCA CAA CTT GGC ATC AAT TTG GTC ACA TCC ATG ATG GGT 
Pro Gly Ala Gin Leu Gly He Asn Leu Val Thr Ser Met Me^ Gly 

CAC CTA TTG CAC CAT TTC TAT TGG AGC CCT CCT AAA GGT GTA 
His Leu Leu His His Phe Tyr Trp Ser Pro Pro Lys Gly Val Ly 

CCA GAG GAG ATT GAC ATG TCA GAG AAT CCA GGA TTG GTC ACC TAC 
Pro Glu Glu He Asp Met Ser Glu Asn Pro Gly Leu Val Thr Tyr 

ATG CGA ACC CCG GTG CAA GCT GTT CCC ACT CCA AGG CTG CCT GCT 
Met Arg Thr Pro Val Gin Ala Val Pro Thr Pro Arg Leu Pro Ala 

CAC TTG TAC AAA CGT GTA GCT GTG GAT ATG TAATTCTTAG TTTGTTATTA 
His Leu Tyr Lys Arg Val Ala Val Asp Met 

TTCATGCTCT TAAGGTTTTG GACTTTGAAC TTATGATGAG ATTTGTAAAA TTCCAAGTGA 
TCAAATGAAG AAAAGACCAA ATAAAAAGGC TTGACGATTT AAAAAAAAAA AAAAAAA 



1001 



1046 



1091 



1271 




1361 



14 06 



1451 



1496 



1591 



^formation for Sequence ID #: 
Correspondence Address: 



Addressee: 
Streei 
City: 
Country:' 
Zip: 

Computer Readable Form: 
Medium ^Type : 
Operating^ System: 
Software : 

Current Application Data 
Attorney Information 
Name 

Registration Number: 
Reference/Docket\Number 
Te lecommuni ca t ion\l n f o rma t i on 
Telephone 
Telefax: 
Information for Seq ID No: 
Sequence Characteristics : 
Length : 
Type: 

Strandedness : 
Topology : 
Molecule Type: 
Hypothetical : 
Anti-Sense: 
Fragment Type : 
Original Source: 
Organism: 
Strain : 

Individual Isolate: 
Developmental State: 
Hap lo type : 
Tissue Type: 
Cell Type: 
Cell Line: 
Organelle: 



2 (FA5H-2) 

Luedeka, Neely & Graham 

P.O. Box 1871 

Knoxville, TN 37901 

Mark S. Graham 

P.O. Box 1871 

Knoxville 

USA 

37901 

1.44 
DOS 
ASCII 
N/A 

Mark S. Graham 

32,355 

50617.00 

(423) 546-4305 
(423) 523-4478 



1883 

DNA 

double 

linear 

cDNA 

tfo 

\ 

Liquidambar styraciflua (L.) 
Wild Wpe 
N/A V 
sporophy^ 
N/A 
xylem 
parenchyma 
N/A 
N/A 



32 



TGCAAACCTG CACAAACAAA GAGAGAGAAG AAGAAAAAGG 
AW?AGAGGAG AGAGAGAGAG AGAGAGAGAA GC 

CAT^GA TTC TTC TCT TCA TGA AGC CTT GCA ACC ACT ACC CAT GAC GCT 
Met Asp Ser Ser Leu His Glu Ala Leu Gin Pro Leu Pro Met Thr Leu 



.40 
72 
120 



GTT CTT \AT TAT ACC TTT GCT ACT CTT ATT GGG CCT AGT ATC TCG GCT 
Phe Phe lie lie Pro Leu Leu Leu Leu Leu Gly Leu Val Ser Arg Leu 



168 



TCG CCA GAG ACTVACC ATA CCC ACC AGG CCC AAA AGG CTT ACC GGT GAT 
Arg Gin Arg Leu\ro Tyr Pro Pro Gly Pro Lys Gly Leu Pro Val He 



216 



CGG AAA CAT GCT CAT GAT GGA TCA ACT CAC TCA CCG AGG ACT CGC CAA 
Gly Asn Met Leu Met Me\Asp Gin Leu Thr His Arg Gly Leu Ala Lys 



264 



ACT CGC CAA ACA ATA CGG CGG 
Leu Ala Lys Gin Tyr Gly Gly 



ATT CCA CCT CAA GAT GGG ATT CTT 
Phe His Leu Lys Met Gly Phe Leu 



312 



ACA CAT GGT GGC CGT TTC CAC ACC C(?A CAT GGC TCG CCA AGT CCT TCA 
His Met Val Ala Val Ser Thr Pro Asp\Met Ala Arg Gin Val Leu Gin 



CC\? 



AGT CCA AGA CAA CAT CTT CTC GAA CCG GCC\AGC CAC CAT AGC CAT CAG 
Val Gin Asp Asn He Phe Ser Asn Arg Pro Ala Thr He Ala He Ser 



360 



408 



CTA CCT CAC CTA TGA CCG AGC CGA CAT GGC CTT GGC TCA CTA CGG CCC 
Tyr Leu Thr Tyr Asp Arg Ala Asp Met Ala Phe Ala His Tyr Gly Pro 



GTT TTG GCG TCA GAT GCG TAA ACT CTG CGT CAT GAA ATT ATT TAG CCG 
Phe Trp Arg Gin Met Arg Lys Leu Cys Val Met Lys Leu\Phe Ser Arg 



GAA ACG AGC CGA GTC GTG GGA GTC GGT CCG AGA CGA GGT CGA 
Lys Arg Ala Glu Ser Trp Glu Ser Val Arg Asp Glu Val Asp 



456 



AGT ACG AGT GGT CGC GTC CAA TAT TGG GTC GAC GGT GAA TAT CGG CG 
Val Arg Val Val Ala Ser Asn He Gly Ser Thr Val Asn He Gly Glu 



GCT GGT TTT TGC TCT GAC GAA GAA TAT TAC TTA CAG GGC GGC TTT TGG 
Leu Val Phe Ala Leu Thr Lys Asn lie Thr Tyr Arg Ala Ala Phe Gly 



GAC GAT CTC GCA TGA GGA CCA GGA CGA GTT CGT GGC CAT ACT GCA AGA 
Thr He Ser His Glu Asp Gin Asp Glu Phe Val Ala He Leu Gin Glu 

GTT TTC GCA GCT GTT TGG TGC TTT TAA TAT AGC TGA TTT TAT CCC TTG 




744 



33 



PheNser Gin Leu Phe Gly Ala Phe Asn lie Ala Asp Phe lie Pro Trp 

GCT CAA ATG GGT TCC TCA GGG GAT TAA CGT CAG GCT CAA CAA GGC ACG 
Leu Lys\Trp Val Pro Gin Gly He Asn Val Arg Leu Asn Lys Ala Arg 



792 



AGG GGC GOT TGA TGG GTT TAT TGA CAA GAT CAT CGA CGA TCA TAT ACA 
Gly Ala Leu\ Asp Gly Phe He Asp Lys He He Asp Asp His He Gin 



840 



GAA GGG GAG tA AAA CTC GGA GGA GGT TGA TAC TGA TAT GGT AGA TGA 
Lys Gly Ser LysNAsn Ser Glu Glu Val Asp Thr Asp Met Val Asp Asp 



888 



TTT ACT TGC TTT TTA\C GG TGA GGA AGC CAA AGT AAG CGA ATC TGA CGA 
Leu Leu Ala Phe Tyr \ly Glu Glu Ala Lys Val Ser Glu Ser Asp Asp 



936 



TCT TCA AAA TTC CAT CAA 
Leu Gin Asn Ser He Lys 



Le\ 



CAC CAA AGA CAA CAT CAA AGC TAT CAT 
Thr Lys Asp Asn He Lys Ala He Met 



984 



GGA CGT AAT GTT TGG AGG GAC CG^AAC GGT GGC GTC CGC GAT TGA ATG 
Asp Val Met Phe Gly Gly Thr Glu\hr Val Ala Ser Ala He Glu Trp 



1032 



GGC CAT GAC GGA GCT GAT GAA AAG CCC AGA AGA TCT AAA GAA GGT CCA 
Ala Met Thr Glu Leu Met Lys Ser Pro G1U Asp Leu Lys Lys Val Gin 



1080 



ACA AGA ACT CGC CGT GGT GGT GGG TCT TGA CCC 
Gin Glu Leu Ala Val Val Val Gly Leu Asp Arg 



GCG AGT CGA AGA GAA 
Val Glu Glu Lys 



1128 



AGA CTT CGA GAA GCT CAC CTA CTT GAA ATG CGT ACTXSAA GGA AGT CCT 
Asp Phe Glu Lys Leu Thr Tyr Leu Lys Cys Val Leu LVs Glu Val Leu 



1176 



TCG CCT CCA CCC ACC CAT CCC ACT CCT CCT CCA CGA GAC TGC CGA GGA 
Arg Leu His Pro Pro He Pro Leu Leu Leu His Glu Thr Alk Glu Asp 



1224 



CGC CGA GGT CGG CGG CTA CTA CAT TCC GGC GAA ATC GCG GGT GAT, GAT 
Ala Glu Val Gly Gly Tyr Tyr He Pro Ala Lys Ser Arg Val MetN£le 



1272 



CAA CGC GTG CGC CAT CGG CCG GGA CAA GAA CTC GTG GGC CGA CCC AGA 
Asn Ala Cys Ala He Gly Arg Asp Lys Asn Ser Trp Ala Asp Pro Asp 



1320 



TAC GTT TAG GCC CTC CAG GTT TCT CAA AGA CGG TGT GCC CGA TTT CAA 
Thr Phe Arg Pro Ser Arg Phe Leu Lys Asp Gly Val Pro Asp Phe Lys 



AGG GAA CAA CTT CGA GTT CAT CCC ATT CGG GTC AGG TCG TCG GTC TTG 
Gly Asn Asn Phe Glu Phe He Pro Phe Gly Ser Gly Arg Arg Ser Cys 



1416 



34 



CCC CGG TAT GCA ACT CGG ACT CTA CGC GCT AGA GAC GAC TGT GGC TCA 

pro Giy\ Met Gln Leu G1 y Leu r y r Ala Leu Glu Thr Thr Val Ala His 



1464 



CCT CCT TCA CTG TTT CAC GTG GGA GTT GCC GGA CGG GAT GAA ACC GAG 
Leu Leu His\cys Phe Thr Trp Glu Leu Pro Asp Gly Met Lys Pro Ser 



1512 



TGA ACT CGA GAT, GAA TGA TGT GTT TGG ACT CAC CGC GCC AAG AGC GAT 
Glu Leu Glu Met\sn Asp Val Phe Gly Leu Thr Ala Pro Arg Ala lie 

TCG ACT CAC CGC CGT GCC GAG TCC ACG CCT TCT CTG TCC TCT CTA 
Arg Leu Thr Ala Val Vro Ser Pro Arg Leu Leu Cys Pro Leu Tyr 



1560 



1605 



TTGATCGAAT GATTGGGGGA GCTTTGTGGA GGGGCTTTTA TGGAGACTCT ATATATAGAT 1665 
GGGAAGTGAA ACAACGACAG GTGA^TGCTT GGATTTTTGG TATATATTGG GGAGGGAGGG 1725 



GAAAAAAAAA ATAATGAAAG GAAAGAAA^G AGAGAATTTG AATTTCTCTT CCTCTGTGGA 1785 
TAAAAGCCTC GTTTTTAATT GTTTTTATGT GGAGATATTT GTGTTTGTTT ATTTTTATCT 1845 



CTTTTTTTGC AATAACACTC AAAAATAAAA A^AAAAAA 



1883 



\ 
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Information for Sequence ID #: 
Correspondence Address : 



Addressee : 
Street : 
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Country: 
Zip : 
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Medium Type: 
Operating Systeii 
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Attorney Information: 
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Telecommunication Information: 
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Telefax: 
Information for Seq ID No: 
Sequence Characteristics: 
Length: 
Type: 

Strandedness : 
Topology : 
Molecule Type: 
Hypothetical : 
Ant i- Sense: 
Fragment Type: 
Original Source: 
Organism: 
Strain: 

Individual Isolate : 
Developmental State: 
Haplotype : 
Tissue Type: 
Cell Type: 
Cell Line: 
Organelle: 
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P.O. Box 1B71 

Knoxville, TN 37901 
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P.O. Box 1871 

Knoxville 

USA 

37901 

1.44 
DOS 
ASCII 
N/A 

Mark S. 
32,355 
50617.00 

(423) 546-4305 
(423) 523-4478 



138C 
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cDNA 
No 
No 
N/A 

Liguidambar styi 
Wild Type 
N/A 

sporophyte 
N/A 
xylem 
parenchyma 
N/A 
N/A 
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CACGAGC CCTACCTCCT TTCTTGGAAA AATTTCCCCA TTCGATCACA ATCCGGGCCT 



60 



caaaaa\^tg GGA TCA ACA AGC GAA ACG AAG ATG AGC CCG AGT GAA GCA 
*t Gly Ser Thr Ser Glu Thr Lys Met Ser Pro Ser Glu Ala 

GCA GCA GCA GAA GAA GAA GCA TTC GTA TTC GCT ATG CAA TTA ACC AGT 
Ala Ala Ala\siu Glu Glu Ala Phe Val Phe Ala Met Gin Leu Thr Ser 

GCT TCA GTT Cl\cCC ATG GTC CTA AAA TCA GCC ATA GAG CTC GAC GTC 
Ala Ser Val Leu \ro Met Val Leu Lys Ser Ala He Glu Leu Asp Val 

TTA GAA ATC ATG GCT\ AAA GCT GGT CCA GGT GCG CAC ATA TCC ACA TCT 
Leu Glu He Met Ala \ys Ala Gly Pro Gly Ala His He Ser Thr Ser 

GAC ATA GCC TCT AAG CTG\CCC ACA AAG AAT CCA GAT GCA GCC GTC ATG 
Asp He Ala Ser Lys Leu Pro Thr Lys Asn Pro Asp Ala Ala Val Met 



CTT GAC CGT ATG CTC CGC CTC N 
Leu Asp Arg Met Leu Arg Leu 



GCT AGC TAC TCT GTT CTA ACG TGC 
Ala Ser Tyr Ser Val Leu Thr Cys 



108 



156 



204 



252 



300 



348 



TCT CTC CGC ACC CTC CCT GAC GGC AAG ATC GAG AGG CTT TAC GGC CTT 396 
Ser Leu Arg Thr Leu Pro Asp Gly L/s^ He Glu Arg Leu Tyr Gly Leu 

GCA CCC GTT TGT AAA TTC TTG ACC AGA AAC GAT GAT GGA GTC TCC ATA 444 
Ala Pro Val Cys Lys Phe Leu Thr Arg Asn\Asp Asp Gly Val Ser He 

GCC GCT CTG TCT CTC ATG AAT CAA GAC AAG G*TC CTC ATG GAG AGC TGG 4 92 

Ala Ala Leu Ser Leu Met Asn Gin Asp Lys ValNLeu Met Glu Ser Trp 

TAC CAC TTG ACC GAG GCA GTT CTT GAA GGT GGA ATT CCA TTT AAC AAG 540 
Tyr His Leu Thr Glu Ala Val Leu Glu Gly Gly IleNPro Phe Asn Lys 

GCC TAT GGA ATG ACA GCA TTT GAG TAC CAT GGC ACC Ga\ CCC AGA TTC 588 
Ala Tyr Gly Met Thr Ala Phe Glu Tyr His Gly Thr Asp^ro Arg Phe 

AAC ACA GTT TTC AAC AAT GGA ATG TCC AAT CAT TCG ACC ATl\ACC ATG 636 
Asn Thr Val Phe Asn Asn Gly Met Ser Asn His Ser Thr Ile\hr Met 

AAG AAA ATC CTT GAG ACT TAC AAA GGG TTC GAG GGA CTT GGA TC*f\GTG 6 34 

Lys Lys He Leu Glu Thr Tyr Lys Gly Phe Glu Gly Leu Gly Ser yal 

GTT GAT GTT GGT GGT GGC ACT GGT GCC CAC CTT AAC ATG ATT ATC GCT\ 7 32 

Val Asp Val Gly Gly Gly Thr Gly Ala His Leu Asn Met He He Ala 

AAA TAC CCC ATG ATC AAG GGC ATT AAC TTC GAC TTG CCT CAT GTT ATT \ 7B0 
Lys Tyr Pro Met He Lys Gly He Asn Phe Asp Leu Pro His Val He 

GAG GAG GCT CCC TCC TAT CCT GGT GTG GAG CAT GTT GGT GGA GAT ATG 
Glu Glu Ala Pro Ser Tyr Pro Gly Val Glu His Val Gly Gly Asp Met 

TTT GTT AGT GTT CCA AAA GGA GAT GCC ATT TTC ATG AAG TGG ATA TGT 876 
Phe Val Ser Val Pro Lys Gly Asp Ala He Phe Met Lys Trp He Cys 



37 



CAT GXT TGG AGC GAT GAA CAC TGC TTG AAG TTT TTG AAG AAA TGT TAT 
His AspvTrp Ser Asp Glu His Cys Leu Lys Phe Leu Lys Lys Cys Tyr 



GAA GCA C 
Glu Ala 



ACC AAT GGG AAG GTG ATC CTT GCT GAA TGC ATC CTC 
Pro Thr Asn Gly Lys Val lie Leu Ala Glu Cys He Leu 



CCC GTG GCG CCA\GAC GCA AGC CTC CCC ACT AAG GCA GTG GTC CAT ATT 

Pro Val Ala Pro Asp Ala Ser Leu Pro Thr Lys Ala Val Val His He 

GAT GTC ATC ATG TTG\GCT CAT AAC CCA GGT GGG AAA GAG AGA ACT GAG 

Asp val He Met Leu )t^a His Asn Pro Gly Gly Lys Glu Arg Thr Glu 

AAG GAG TTT GAG GCC TTGNGCC AAG GGG GCT GGA TTT GAA GGT TTC CGA 

Lys Glu Phe Glu Ala Leu Ala Lys Gly Ala Gly Phe Glu Gly Phe Arg 



1020 



1068 



GTA GTA GCC TCG TGC GCT TAC 
Val Val Ala Ser Cys Ala Tyr 

AAG ATT TGAGTCCTTA CTCGGCTTTG 
Lys He 



VT ACA TGG ATC ATC GAA TTT TTG AAG 
Thr Trp He He Glu Phe Leu Lys 



CATAAT ACCAACTCCT TTTGGTTTTC 



GAGATTGTGA TTGTGATTGT GATTGTCTCT 
TCGTTAACTC GATCACAGAA GTGCAAAAGA CAGTG, 
AATTTTAAGA TTTTGATTCA TGTAAAAAAA 



TGGCCTTATG ATATAATGTA 
T ACACTGCTTT ATAAAATAAA 




1280 
1340 
1380 
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Information for Sequence ID #: 
Correspondence Address : 



Addressee" 
Street: 
City: 
Country : 
Zip : 

Computer Readable Form: 
Medium Type: 
Operating Syst* 
Software : 

Current Application Data; 
Attorney Information: 
Name : 

Registration Number: 
Reference/Docket Number N 
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Telefax: 
Information for Seq ID No: 
Sequence Characteristics : 
Length : 
Type: 

Strandedness : 
Topology : 
Molecule Type : 
Hypothetical : 
Anti-Sense : 
Fragment Type: 
Original Source: 
Organism: 
Strain: 

Individual Isolate: 
Developmental State : 
Haplotype : 
Tissue Type: 
Cell Type: 
Cell Line: 
Organelle : 
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Luedeka, Neely & Graham 

P.O. Box 1871 

Knoxville, TN 37901 

Mark S . Graham 

P.O. Box 1871 

Knoxville 

USA 

37901 

1.44 
DOS 
ASCII 
N/A 
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(423) 523-4478 
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Dm 
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N/A 
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kCGAGC TCATTTTCCA CTTCTGGTTT GATCTCTGCA ATTCTTCCAT CAGTCCCTA 

ATG GA& ACC CAA ACA AAA CAA GAA GAA ATC ATA TAT CGG TCG AAA 
Met Glu\hr Gin Thr Lys Gin Glu Glu lie lie Tyr Arg Ser Lys 

CTC CCC GAtNaTC TAC ATC CCC AAA CAC CTC CCT TTA CAT TCG TAT 
Leu Pro Asp rie Tyr He Pro Lys His Leu Pro Leu His Ser Tyr 

TGT TTC GAG AAC ATC TCA CAG TTC GGC TCC CGC CCC TGT CTG ATC 
Cys Phe Glu Asn ire Ser Gin Phe Gly Ser Arg Pro Cys Leu He 

AAT GGC GCA ACG GGC AAG TAT TAC ACA TAT GCT GAG GTT GAG CTC 
Asn Gly Ala Thr Gly Lys\Tyr Tyr Thr Tyr Ala Glu Val Glu Leu 

ATT GCG CGC AAG GTC GCA TCTL GGC CTC AAC AAA CTC GGC GTT CGA 
He Ala Arg Lys Val Ala Ser <S,ly Leu Asn Lys Leu Gly Val Arg 

CAA GGT GAC ATC ATC ATG CTT TTG^TA CCC AAC TCG CCG GAG TTC 
Gin Gly Asp He He Met Leu Leu Leu Pro Asn Ser Pro Glu Phe 



GTG TTT TCA ATT CTC GGC GCA TCC TAC 
Val Phe Ser He Leu Gly Ala Ser Tyr 



3C GGG GCT GCC GCC ACC 
Gly Ala Ala Ala Thr 



GCC GCA AAC CCG TTT TAT ACC CCT GCC GAG 
Ala Ala Asn Pro Phe Tyr Thr Pro Ala Glu 



AGG AAG CAA GCC 
Arg Lys Gin Ala 



AAA ACC TCC AAC GCC AGG CTT ATT ATC ACA CAT GTC TGT TAC TAT 
Lys Thr Ser Asn Ala Arg Leu He He Thr His Ala^Cys Tyr Tyr 

GAG AAA GTG AAG GAC TTG GTG GAA GAG AAC GTT GCC AAG ATC ATA 
Glu Lys Val Lys Asp Leu Val Glu Glu Asn Val Ala Lys\Ile He 

TGT ATA GAC TCA CCC CCG GAC GGT TGT TTG CAC TTC TCG GAG CTG 
Cys He Asp Ser Pro Pro Asp Gly Cys Leu His Phe Ser Gl\ Leu 

AGT GAG GCG GAC GAG AAC GAC ATG CCC AAT GTA GAG ATT GAC 
Ser Glu Ala Asp Glu Asn Asp Met Pro Asn Val Glu He Asp Pr 

GAT GAT GTG GTG GCG CTG CCG TAC TCG TCA GGG ACG ACG GGT TTA 
Asp Asp Val Val Ala Leu Pro Tyr Ser Ser Gly Thr Thr Gly Leu 

CCA AAG GGG GTG ATG CTA ACA CAC AAG GGA CAA GTG ACG AGT GTG 
Pro Lys Gly Val Met Leu Thr His Lys Gly Gin Val Thr Ser Val 

GCG CAA CAG GTG GAC GGA GAG AAT CCG AAC CTG TAT ATA CAT AGC 
Ala Gin Gin Val Asp Gly Glu Asn Pro Asn Leu Tyr He His Ser 

GAG GAC GTG GTT CTG TGC GTG TTG CCT CTG TTT CAC ATC TAC TCG 
Glu Asp Val Val Leu Cys Val Leu Pro Leu Phe His He Tyr Ser 

ATG AAC GTC ATG TTT TGC GGG TTA CGA GTT GGT GCG GCG ATT CTG 
Met Asn Val Met Phe Cys Gly Leu Arg Val Gly Ala Ala He Leu 

ATT ATG CAG AAA TTT GAA ATA TAT GGG TTG TTA GAG CTG GTC AG A 
He Met Gin Lys Phe Glu He Tyr Gly Leu Leu Glu Leu Val Arg 



60 
105 

150 

195 

240 

285 

330 

375 

420 

465 

510 

555 

600 

645 



825 



870 



AGT ACA GGT GAC CAT CAT GCC TAT CGT ACA CCC ATC GTA TTG GCA 



915 



40 



Se\Thr Gly Asp His His Ala Tyr Arg Thr Pro lie Val Leu Ala 



ATC TCC AAG ACT CCG GAT CTT CAC AAC TAT GAT 
lie Se\ Lys Thr Pro Asp Leu His Asn Tyr Asp 

CGG ACT G^C ATG TCA GGT GCG GCT CCT CTG GGC 
Arg Thr VaJvMet Ser Gly Ala Ala Pro Leu Gly 

GAT TCT GTC AGA GCT AAG TTT CCC ACC GCC AAA 
Asp Ser Val ArgN^la Lys Phe Pro Thr Ala Lys 

TAT GGA ATG ACG GA&s^CA GGG CCC GTG CTA GCG 
Tyr Gly Met Thr Glu Ala Gly Pro Val Leu Ala 



TTT GCC AAG GAA GGG TTT 
Phe Ala Lys Glu Gly Phe 



ATA AAA TCG GGG 
lie Lys Ser Gly 



GTT TTA AGG AAC GCA CAG ATG A^G ATT GTG GAC 
Val Leu Arg Asn Ala Gin Met Lys^Ile Val Asp 

GTC ACT CTC CCT CGA AAC CAA CCC G&A GAG ATT 
Val Thr Leu Pro Arg Asn Gin Pro GlyvGlu lie 

GAC CAA ATC ATG AAA GGT TAT CTT AAT GAT CCT 
Asp Gin lie Met Lys Gly Tyr Leu Asn Asp\Pro 

AGA ACC ATA GAC AAG GAA GGT TGG TTA CAC AC 
Arg Thr lie Asp Lys Glu Gly Trp Leu His 

TAC ATC GAC GAT GAC ACT GAG CTC TTC ATT GTT 
Tyr lie Asp Asp Asp Thr Glu Leu Phe lie Val 

GAA CTG ATC AAA TAC AAA GGG TTT CAG GTG GCA 
Glu Leu lie Lys Tyr Lys Gly Phe Gin Val Ala 

GAG GCC ATG CTC CTC AAC CAT CCC AAC ATC TCT 
Glu Ala Met Leu Leu Asn His Pro Asn lie Ser 



GTG TCC TC ATT 
Val Ser Ser lie 

AAG GAA CT GAA 
Lys Glu Leu Glu 

CTT GGT CA^SGA 
Leu Gly Gin Gly 

ATG TGT TTG CA 
Met Cys Leu Ala 

GCA TCT GGA AT 
Ala Ser Gly Thr 

CCT GAA ACC GG 
Pro Glu Thr Gly 

TGC ATT AGA GGA 
Cys lie Arg Gly 

GAG GCG ACG GAG 
Glu Ala Thr Glu 

GGT GAT GTG GGC 
VGly Asp Val Gly 

GA* CGG TTG AAG 
AspN^rg Leu Lys 

CCC GC* GAG CTT 
Pro Ala X31u Leu 

GAT GCT GCOy GTC 
Asp Ala Ala 



960 



1005 



1050 



1095 



1140 



1185 



1230 



1275 



1320 



1365 



1410 



1455 



GTC CCA ATG AAA GAC GAT GAA GCT GGA GAG CTC 
Val Pro Met Lys Asp Asp Glu Ala Gly Glu Leu 

GTT GTA AGA TCA GAT GGT TCT CAG ATA TCC GAG 
Val Val Arg Ser Asp Gly Ser Gin lie Ser Glu 



CCT GTG GCG 
Pro Val Ala Phe > 

GCT GAA ATC AGG 
Ala Glu He Arg 



1500 



1545 



CAA TAC ATC GCA AAA CAG GTG GTT TTT TAT AAA 
Gin Tyr He Ala Lys Gin Val Val Phe Tyr Lys 

GTA TTT TTC GTC GAA GCC ATT CCT AAA GCG CCC 
Val Phe Phe Val Glu Ala He Pro Lys Ala Pro 



AGA ATA CAT CGC 
Arg He His Arg 

TCT GGC AAA ATC 
Ser Gly Lys He 



1590 



TTG CGG AAG GAC CTG AGA GCC AAA TTG GCG TCT 
Leu Arg Lys Asp Leu Arg Ala Lys Leu Ala Ser 

TAATTCTCAT TCGCTACCCT CCTTTCTCTT ATCATACGCC 

ATTAAACGCT GCTCATTCGA AGCGG CTCAA TTAAAGCTGC 



GGT CTT CCC AAT 1680\ 
Gly Leu Pro Asn 

AACACGAACG AAGAGGCTCA 174 0 

TCATTCATGT CCACCGAGTG 1800 



41 



42 



:ion for Sequence ID # : 
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Type: 

Strandedness : 
Topology: 
Molecule Type: 
Hypothetical : 
Anti-Sense : 
Fragment Type: 
Original Source : 
Organism: 
Strain: 

Individual Isolate: 
Developmental State: 
Haplotype : 
Tissue Type: 
Cell Type: 
Cell Line: 
Organelle : 



5 (PAL) 

Luedeka, Neely & Graham 

P.O. Box 1871 

Knoxville, TN 37901 

Mark S . Graham 

P.O. Box 1871 

Knoxville 

USA 

37901 

1.44 
DOS 
ASCII 
N/A 



Mark S 
32,355 
50617.00 



Graham 



(423) 546-4305 
v(423) 523-4478 



1544 
DNA 
double 
linear 
cDNA 

NO 
NO 
N/A 



Pinus taeda 
Wild Type 
N/A 

sporophyte 

N/A 

xylem 

parenchyma 

N/A 

N/A 



(L.) 
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AAA^ATAATA TATGTGTATG CCTACTACTA 


CACATTGTTT 


TGAAGTGTGT 


AAACATAGTG 


60 


CAACACTJ^G AGGACTCACA ATGAGCACTT 


GTTGACATGA 


AACTAGCTAA 


ATGCCCAACA 


120 


atattagtgaSaagctagtta aactaacccc 


TTTGACTTTC 


AAGATGATAT 


ATTT AT AT CC 


180 


CTACTACGTC TTOCTCTTTT TGTCTTTCTC 


TTGTGATTAA 


ACCTTCCTTG 


AAACAATTCT 


240 


CAAATGTAAA ATTAA^CCTT GAAACTTGTA 


GAGACCAAAC 


TTCCCTAGGA 


GAAACCACAT 


300 


TTATGACAAC ATATATACAC CAACCCATTG 


CATACTATAA 


TATTGGAATT 


ACCTGCAGCG 


360 


AACGAAAGAA ACGCTGTCTC A<^CAACTCGT 


GCACTACATC 


CCGAAACTTA 


ACCTTCCCCT 


420 


GATACAGATT GAAGAGCCGA AAAA^GCGTG 


CATCCAAATT 


TCTGGTATGG 


TGAGGAGCCG 


480 


AAAAACGCGT GCGCCTAATT TTTTTGAGAT 


GGGCCGGAAA 


ATAATGCGTG 


CATCTAAATT 


540 


TTCACGTGTC GCGTATTGGC GAGGTTGCGG^ 


TGAATGTGAT 


CCTGTGCGTG 


AGCCACATTC 


600 


ATTCCATTGG TTGACCCGCC GGTACCGCGA 


G&JACCGTGGG 


GTCTCACAGA 


TACGCGGATG 


660 


GTGGATCAGC ACTGAGAAGA TTAGATGATG 


ACCAGGCGGG 


CATTTGAAGT 


AAAAACTTGG 


720 


GGGTGGTTGG CAAGTACGCG ACAAAGAGGG 


GTAGTGOGCA 


AGGAAGCGAG 


TTGGATGCAA 


780 


ATAATATTAC AAAGTGGGTT GGTGGGCATG 


AGCATCAA02 


AGAATGATGT 


TGTTGCTGGT 


840 


TCCGTGCAAA TTCTGACCAG TAGTTTGAAC 


AATACTACCC 


AACTTGTTTT 


TGGTAAAACA 


900 


TGAAGTGGGT AAGGAGAATT GAACTTACGT 


CTCATGGTAA 


aggHcaaggg 


CAAATGACTT 


960 


AACACATACC TTTAACTAAT AAAAATACCC 


CTAACAAATA 


CGAAAACGAA 


TGAGTTATCA 


1020 


CAGACCTTCA ACTAATAAGA TAGCCATCAG 


ACCCACATCT 


CCTGACTGkc 


CAAAAACAAA 


1080 


TGACTTCAAC CAACTAAGAT ACCCATCAAA 


GCTAACCCAC 


AACCCAATTC 


xrrcACTTCcc 


1140 


CTTACCAGAC CAACCAAGCA GACCTACGCC 


ATTAACTACT 


TTAGGACGTG 


GGAATTGGGG 


1200 


GTG CCACCGT TGAAGAATGG CACTCAGGGT 


TGG TAATCCC 


TCCACGTGTA 


TGTAGCAGTC 


1260 


GTTTGGTGGA GACGGCGTGT TTGAATGTCC 


ACCTTCCAGT 


TTGGAGAACA 


AGGAAATteG 


1320 


GCTTATATTA GGCCTGGATC TCTTGTTTCA 


GAGCAGGAGT 


AGTTCAGGAC 


AGGAACTAGCV 


1380 


ATTCAAGAAT TCAATTGCCC TGCCCTGCTC 


TGCTCTGCTT 


TGCTCAACTT 


ATTGATCCCT 


\ 1440 


GCTCTGGTTT GTTCAATTTC TTGACCCCTG 


CTGGGTTCTG 


CTCTGGTTTG 


CACACTTTCT 


IS500 


CGATTATATA AGTCATTTTG GATCCTTGCA 


AGGAAGAGAA 


TATG 




15^4 
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Inf 
Corrfc 



for Sequence ID # 
spondence Address : 



Addressee 
Street : 
City: 
Country 
Zip: 

Computer ReadabT 
Medium Type 
Operating Syst< 
Software 

Current Applicat 
Attorney Information: 
Name : 

Registration Number: 
Reference/Docket Numbe 
Telecommunication Info 
Telephone : 
Telefax: 
Information for Seq ID No: 
Sequence Characteristics: 
Length : 
Type : 

Strandedness : 
Topology : 
Molecule Type: 
Hypothetical : 
Anti-Sense : 
Fragment Type : 
Original Source : 
Organism: 
Strain: 

Individual Isolate: 
Developmental State: 
Haplotype : 
Tissue Type: 
Cell Type: 
Cell Line: 
Organelle : 



Data : 



6 (4CL1B) 

Luedeka, Neely & Graham 

P.O. Box 1871 

Knoxville, TN 37901 

Mark S . Graham 

P.O. Box 1871 

Knoxville 

USA 

37901 

1 .44 
DOS 
ASCII 
N/A 



Graham 



Mark S 
32, 355 
50617.00 



(423) 546-4305 
(423) 523-4478 




Pinus taeda 
Wild Type 
N/A 

sporophyte 

N/A 

xylem 

parenchyma 

N/A 

N/A 



tCACCAAT 
TTATTGTAAT 
CCGAAAA^SAG 
CGGGTGTTGG 
AATGGAGTTT 
CAAAAATCCA 
CACTCAATCG 
ACCAACAATT 
AGCCGGCCTC 
TACATTTGTC 
GTTCGGATTG 



TTAATGGGAT TTCAGATTTG TATCCCATGC TATTGGCTAA GGCATTTTTC 
CTAACCAATT CTAATTTCCA CCCTGGTGTG AACTGACTGA CAAATGCGGT 
CGAATGAAAT GTCTGGGTGA TCGGTCAAAC AAGCGGTGGG CGAGAGAGCG 
ICTAGCCGGG ATGGGGGTAG GTAGACGGCG TATTACCGGC GAGTTGTCCG 
TC^SGTAGG TAGTAACGTA GACGTCAATG GAAAAAGTCA TAATCTCCGT 
ACCGC\CCTT CACATCGCAG AGTTGGTGGC CACGGGACCC TCCACCCACT 
ATCGCCTd^C GTGGTTGCCC ATT ATT C AAC CATACGCGAC TTGACTCTTC 
CCAGGCCGGC VTTCTATACA ATGTACTGCA CAGGAAAATC CAATATAAAA 
TGCTTCCTTC TCAGTAGCCC CCAGCTCATT CAATTCTTCC CACTGCAGGC 
AGACACGTTT TCCGCCATTT TTCGCCTGTT TCTGCGGAGA ATTTGATCAG 
GGATTGAATC AATTGAAAGG TTTTTATTTT CAGTATTTCG ATCGCCATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
659 




46 



J 



Ihf ormation for Sequence ID #: 
Correspondence Address : 



Addressee : 
Street : 
City: 
Country : 
Zip: 

Computer Readable Form: 
Medium TypeN 
Operating System: 
Software : 

Current Application Data 
Attorney Information: 
Name : 

Registration Number: 
Reference /Docket Numbe\: 
Telecommunication Infoi 
Telephone : 
Telefax: 
Information for Seq ID No: 
Sequence Characteristics: 
Length : 
Type: 

Strandedness : 
Topology : 
Molecule Type: 
Hypothetical : 
Anti-Sense: 
Fragment Type: 
Original Source : 
Organism : 
Strain: 

Individual Isolate: 
Developmental State: 
Haplotype : 
Tissue Type: 
Cell Type: 
Cell Line: 
Organelle : 



.tion: 



7 (4CL3B) 

Luedeka, Neely & Graham 

P.O. Box 1871 

Knoxville, TN 37901 

Mark S. Graham 

P.O. Box 1871 

Knoxville 

USA 

37901 

1.44 
DOS 
ASCII 
N/A 

Mark S. Graham 

32,355 

50617.00 

(423) 546-4305 
(423) 523-4478 



*251 
DS 

double 
lines 
cDNA 
No 
No 
N/A 

Pinus taeda 
Wild Type 
N/A 

sporophyte 

N/A 

xylem 

parenchyma 

N/A 

N/A 
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ggScgggtgg 

AAAGT&fcAACA 
AAACC 
CAACCTCCT 
AAAATATTAT 
TCAAAACTTT 
CACAAAAATT 
GTCTCATTAA 
AATAAGGGTG 
ATGGAGTTTT 
TGGAAAAGGT 
AGATGTTAAA 
ATAAAAAATT 
TATTATTTTT 
AAATTAATTT 
TAAATCTATT 
CAATTTGTAT 
TATGATTTCA 
ATTAAGATCT 
ATTGTATCCA 
CAAAATCATT 
AGAAATATAG 
ACATTAAAGC 
TTCTCTCAAT 
TACCCAATAA 
TTTATTGGAA 
AGATACTAAA 
GATTTGTATC 



TGACATTTAT 
AAATTTTCAT 
TATAAAGAAT 
CAACAAAATT 
A&AAAATTTA 
AAAA^AAAGC 
TTGAAAA^AT 
CTCATTAG 
TTTTAATAAG 
TAAAAATATA 
TGGTAAGAAC 
TTTATATATG 
GTTTTGTTAA 
AAAAAATTTG 
TAAATTAATA 
TTGCATTCAA 
AAAAACCAAA 
AGAAAGACAA 
CATTAATTAA 
AGAAATATAG 
ACATTAAAGC 
AATGTTCTCG 
TCATCATGTC 
CTCCCAAAAT 
TATATTTTTT 
TGAAGGTTGA 
TCCATTATAT 
CCATGCTATT 



TCATAAATTC 
CTTTAACATA 
TTCTTTCAAC 
AAAATAGATT 
TTAAAACTTC 
TAAACACTGA 
AAACAAACTT 
TTATAGTTCG 
TTTTGGG 
TATATATATA 
TATAAA* 
TAATTAAAAT 
ATTTAGAGTA 
TTGGTAAATT 
AACTTTTGAA 
AATACAATTT 
AATCTCAAAT 
TAACCAGTTT 
TTCTTATTTT 
AATGTTCTCG 
TCATCATGTC 
TCTAGGGACT 
ATTTGTGGAT 
ATAGTTCGAA 
TATACATTTT 
GTTATAAACT 
AATAAAAACA 
GGCTAAGGCA 



ATCTCAAAAC 
ATTATAATTG 
AATACACTTT 
AATAAATAAA 
AAAATAAACA 
AAATGTGAGT 
GAAACTCTAC 
AATCCAATTA 
ATTTTTTTAG 
TATATTTTTG 
GTTGTGAATG 
'ATTTTGA 
kTTTCAA 
TTATCTTATA 
GTCAAATA 
AAATAATAAA 
AAAATTTAAA 
CCAATAAAAT 
TTAATTTTTT 
TCCAGGGACT 
ATTTGTGGAT 
ATTAATTTCC 
TGGAAATTAG 
CT C CAT ATTT 
AGAGATTTTC 
TTCAGTAATC 
CATTTTAAAC 
TTTTTCTTAT 



AAGAAGGATT 
TGTTCACAAA 
AAT CACAACT 
TAAACTTAAC 
AACTTTTTAT 
ACATTTAAAA 
CTTTTAAGAA 
ACGTATCTTT 
TAATTTATTT 
GGTTGAGTTT 
AGTGTTTTAT 
ATAACAAAAA 
AATCTAAAAT 
TTTAAGTTAA 
CCAAATATTT 
'CATGGA 
TTACjAAAACA 



TACAT 
ATTAATCTCC 
TGGAAATTAT 
AAACAAATTT 
ACAAAAAAAA 
TTGGAAATTG 
CAGACATATT 
CAAGTATCTT 
ACCAATTTAA 
TGTAATCTAA 



TACAAAAATA 
ATTCAAACTT 
TCTTCAATCA 
TATTTAAAAA 
ACAAAATTCA 
GGACGCTGAT 
TGAGTTTGTC 
TATTTTATGG 
GTGATATGTT 
ACTTAAAATT 
GGATTTTTTA 
TTATAATTGG 
AATTAAACAC 
AATTTAGAAA 
TCCAAAATAT 
ATAGATTAAC 
TTATCAACAT 
ATGGCCCGTA 
ATATCTTTAT 
.CAAGTTT 
ATT<^ATAAG 
CAAAATCATT 
TCCCAAATM 
AGAATTTTTT 
TGCTCTGGGA 
CGGTTTTTGA 
TGGGATTTCA 
CCAATTCTAA 
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TTTO 
GGGTGATi 
GGGTAGGTAG 
AACGTAGACG 
TCGCAGAGTT 
CCATTATTCA 
CAATGTACTG 
CCCCAGCTCA 
TTTTCGCCTG 
GGTTTTTATT 



GGTGTGAACT 



TCAAACAAGC 



GGTGGCCACG 
ACCATACGCC 
CACAGGAAAA 
TTCAATTCTT 
TTTCTGCGGA 
TTCAGTATTT 



GACTGACAAA 
GGTGGGCGAG 
ACCGGCGAGT 
AAGTCATAAT 
^CCCTCCA 
ACTTGASTCT 
TCCAATATAA 
CCCACTGCAG 
GAATTTGATC 
CGATCGCCAT 



TGCGGTCCGA 
AGAGCGCGGG 
TGTCCGAATG 
CTCCGTCAAA 
CCCACTCACT 
TCACCAACAA 
ICCGGCC 




AAACAGCGAA 
TGTTGGCCTA 
GAGTTTTCGG 
AATCCAACCG 
CGATCGCCTG 
TTCCAGGCCG 
TCTGCTTCCT 
TCAGACACGT 
TGGGATTGAA 



TGAAATGTCT 
GCCGGGATGG 
GGTAGGTAGT 
CTCCTTCACA 
CCGTGGTTGC 
GCTTTCTATA 
TCTCAGTAGC 
TTTCCGCCAT 
TCAATTGAAA 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2251 
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SEQUENCE LISTING 

<1 10> CHIANG. VINCENT L. 

CARRAWAY. DANIEL T. 

SMELTZER. RICHARD H. 
<120> PRODUCTION OF SYRINGYL LIGNIN IN GYMNOSPERMS 
<130> 044463-0336 
<140> 10/681.878 
<141> 2003-10-09 
<150> 09/796.256 
<151> 2001-02-28 
<150> 08/991.677 
<151> 1997-12-16 
<150> 60/033.381 
<151> 1996-12-16 
<160> 24 

<170> Patentln Ver. 3.3 
<210> 1 
<2U> 1708 
<212>DNA 

<213> Liquidambar styraciflua 

<220> 

<221>CDS 

<222> (48)..(157n 

<400> 1 

cggcacgagg aaaccctaaa actcacctct cttacccttt ctcttca atg get ttc 56 

50 

WASH_1 634660.1 



Met Ala Phe 

1 

ctt eta ata ccc ate tea ata ate ttc ate etc tta get tac cag etc 104 
Leu Leu He Pro He Ser lie lie Phe He Val Leu Ala Tvr Gin Leu 

5 10 15 

tat caa egg etc aga ttt aag etc cca ccc ggc cca cat cca tgg ccg 1 52 
Tvr Gin Arg Leu Arg Phe Lvs Leu Pro Pro Gly Pro Arg Pro Trp Pro 

20 25 30 35 

ate gtc gga aac ctt tac gac ata aaa ccg gtg agg ttc egg tgt ttc 200 
He Val Glv Asn Leu Tvr Asp He Lvs Pro Val Arg Phe Arg Cvs Phe 
40 45 50 

gee gag tgg tea caa gcg tac ggt ccg ate ata teg gtg tgg ttc ggt 248 
Ala Glu Trp Ser Gin Ala Tvr Glv Pro He He Ser Val Trp Phe Glv 

55 60 65 

tea acg ttg aat gtg ate gta teg aat teg gaa ttg get aag gaa gtg 296 
Ser Thr Leu Asn Val He Val Ser Asn Ser Glu Leu Ala Lvs Glu Val 

70 75 80 

etc aag gaa aaa gat caa caa ttg get gat agg cat agg agt aga tea 344 
Leu Lvs Glu Lvs Asp Gin Gin Leu Ala Asp Arg His Arg Ser Arg Ser 

85 90 95 

get gee aaa ttt age agg gat ggg cag gac ctt ata tgg get gat tat 392 
Ala Ala Lvs Phe Ser Arg Asp Glv Gin Asp Leu He Trp Ala Asp Tvr 

100 105 110 115 

gga cct cac tat gtg aag gtt aca aag gtt tgt acc etc gag ctt ttt 440 
Glv Pro His Tvr Val Lvs Val Thr Lvs Val Cvs Thr Leu Glu Leu Phe 

51 
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120 125 130 



act cca aag egg ctt gaa act ctt aga ccc att aga eaa gat gaa gtt 488 
Thr Pro Lvs Arg Leu Glu Ala Leu Arg Pro He Arg Glu Asp Glu Val 

135 140 145 

aca gec atg gtt gag tec att ttt aat gac act gcg aat cct gaa aat 536 
Thr Ala Met Val Glu Ser He Phe Asn Asp Thr Ala Asn Pro Glu Asn 

150 155 160 

tat ggg aag agt atg ctg gtg aag aag tat ttg gga gca gta gca ttc 584 
Tvr Glv Lvs Ser Met Leu Val Lvs Lvs Tvr Leu Glv Ala Val Ala Phe 

165 170 175 

aac aac att aca aga etc gca ttt gga aag cga ttc gtg aat tea gag 632 
Asn Asn He Thr Arg Leu Ala Phe Glv Lvs Arg Phe Val Asn Ser Glu 

180 185 190 195 

ggt gta atg gac gag caa gga ctt gaa ttt aag gaa att gtg gec aat 680 
Glv Val Met Asp Glu Gin Glv Leu Glu Phe Lvs Glu He Val Ala Asn 

200 205 210 

gga etc aag ctt ggt gec tea ctt gca atg get gag cac att cct tgg 728 
Glv Leu Lvs Leu Glv Ala Ser Leu Ala Met Ala Glu His He Pro Trp 

215 220 225 

etc cgt tgg atg ttc cca ctt gag gaa ggg gec ttt gec aag cat ggg 776 
Leu Arg Trp Met Phe Pro Leu Glu Glu Glv Ala Phe Ala Lvs His Glv 

230 235 240 

gca cgt agg gac cga ctt acc aga get ate atg gaa gag cac aca ata 824 
Ala Arg Arg Asp Arg Leu Thr Arg Ala lie Met Glu Glu His Thr He 
245 250 255 

52 
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gcc cgt aaa aag agt ggt gga gcc caa caa cat ttc gtg gat gca ttg 872 
Ala Arg Lvs Lvs Ser Glv Glv Ala Gin Gin His Phe Val Asp Ala Leu 

260 265 270 275 

etc acc eta caa gag aaa tat gac ctt age gag gac act att att ggg 920 
Leu Thr Leu Gin Glu Lvs Tyr Asp Leu Ser Glu Asp Thr He He Glv 

280 285 290 

etc ctt tgg gat atg ate act gca ggc atg gac aca acc gca ate tct 968 
Leu Leu Trp Asp Met He Thr Ala Glv Met Asp Thr Thr Ala He Ser 

295 300 305 

gtc gaa tgg gcc atg gcc gag tta att aag aac cca agg gtg caa caa 1016 
Val Glu Trp Ala Met Ala Glu Leu He Lvs Asn Pro Arg Val Gin Gin 

310 315 320 

aaa get caa gag gag eta gac aat gta ctt ggg tec gaa cgt gtc ctg 1064 
Lvs Ala Gin Glu Glu Leu Asp Asn Val Leu Glv Ser Glu Arg Val Leu 

325 330 335 

acc gaa ttg gac ttc tea age etc cct tat eta caa tgt gta gcc aag 1112 
Thr Glu Leu Asp Phe Ser Ser Leu Pro Tvr Leu Gin Cvs Val Ala Lvs 

340 345 350 355 

gag gca eta agg ctg cac cct cca aca cca eta atg etc cct cat cgc 1 160 
Glu Ala Leu Arg Leu His Pro Pro Thr Pro Leu Met Leu Pro His Arg 
360 365 370 

gcc aat gcc aac gtc aaa att ggt ggc tac gac ate cct aag gga tea 1208 
Ala Asn Ala Asn Val Lvs He Glv Glv Tvr Asp He Pro Lvs Glv Ser 

375 380 385 

aat gtt cat gta aat gtc tgg gcc gtg get cgt gat cca gca gtg tgg 1256 
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Asn Val His Val Asn Val Trp Ala Val Ala Arg Asp Pro Ala Val Trp 

390 395 400 

cat gac cca eta gag ttt cga ccg gaa egg ttc tct gaa gac gat etc 1 304 
Arg Asp Pro Leu Glu Phe Arg Pro Glu Arg Phe Ser Glu Asp Asp Val 

405 410 415 

gac atg aaa ggt cac gat tat agg eta ctg ccg ttt ggt gca ggg agg 1 352 
Asp Met Lvs Glv His Asp Tvr Arg Leu Leu Pro Phe Gly Ala Gly Arg 

420 425 430 435 

cgt gtt tgc ccc ggt gca caa ctt ggc ate aat ttg gtc aca tec atg 1400 
Arg Val Cvs Pro Glv Ala Gin Leu Glv He Asn Leu Val Thr Ser Met 

440 445 450 

atg ggt cac eta ttg cac cat ttc tat tgg age cct cct aaa ggt gta 1448 
Met Glv His Leu Leu His His Phe Tvr Trp Ser Pro Pro Lvs Glv Val 

455 460 465 

aaa cca gag gag att gac atg tea gag aat cca gga ttg gtc acc tac 1496 
Lvs Pro Glu Glu He Asp Met Ser Glu Asn Pro Glv Leu Val Thr Tvr 

470 475 480 

atg cga acc ccg gtg caa get gtt ccc act cca agg ctg cct get cac 1 544 
Met Arg Thr Pro Val Gin Ala Val Pro Thr Pro Arg Leu Pro Ala His 

485 490 495 

ttg tac aaa cgt gta get gtg gat atg taattcttag tttgttatta 1 591 
Leu Tvr Lvs Arg Val Ala Val Asp Met 
500 505 

ttcatgetet taaggttttg gactttgaac ttatgatgag atttgtaaaa ttccaaetga 1651 
tcaaatgaag aaaagaccaa ataaaaaggc ttgacgattt aaaaaaaaaa aaaaaaa 1 708 

54 
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<210>2 
<211> 508 
<212> PRT 

<2 1 3> Liquidambar stvraciflua 
<400> 2 

Met Ala Phe Leu Leu He Pro He Ser He lie Phe He Val Leu Ala 
_J 5 10 15 

Tvr Gin Leu Tvr Gin Are Leu Are Phe Lvs Leu Pro Pro Glv Pro Are 

20 25 30 

Pro Trp Pro He Val Glv Asn Leu Tvr Asp He Lvs Pro Val Are Phe 

35 40 45 

Are Cvs Phe Ala Glu Trp Ser Gin Ala Tvr Glv Pro He He Ser Val 

50 55 60 

Trp Phe Glv Ser Thr Leu Asn Val He Val Ser Asn Ser Glu Leu Ala 
.65 Zfi Z5 80 

Lvs Glu Val Leu Lvs Glu Lvs Asp Gin Gin Leu Ala Asp Arg His Arg 
85 90 95 

Ser Arg Ser Ala Ala Lvs Phe Ser Arg Asp Glv Gin Asp Leu He Trp 

100 105 110 

Ala Asp Tvr Glv Pro His Tvr Val Lvs Val Thr Lvs Val Cvs Thr Leu 

115 120 125 

Glu Leu Phe Thr Pro Lys Arg Leu Glu Ala Leu Arg Pro He Arg Glu 

130 135 140 

Asp Glu Val Thr Ala Met Val Glu Ser He Phe Asn Asp Thr Ala Asn 
145 150 155 160 
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Pro Glu Asn Tvr Glv Lvs Ser Met Leu Val Lvs Lvs Tvr Leu Glv Ala 



165 170 



175 



Val Ala Phe Asn Asn He Thr Are Leu Ala Phe Glv Lvs Are Phe Val 

180 185 190 

Asn Ser Glu Glv Val Met Asp Glu Gin Glv Leu Glu Phe Lvs Glu He 

195 200 205 

Val Ala Asn Glv Leu Lvs Leu Glv Ala Ser Leu Ala Met Ala Glu His 

210 215 220 

He Pro Tip Leu Arg Trp Met Phe Pro Leu Glu Glu Glv Ala Phe Ala 

225 230 235 240 

Lvs His Glv Ala Arg Arg Asp Arg Leu Thr Arg Ala He Met Glu Glu 
245 250 255 

His Thr He Ala Arg Lvs Lvs Ser Glv Glv Ala Gin Gin His Phe Val 

260 265 270 

Asp Ala Leu Leu Thr Leu Gin Glu Lvs Tvr Asp Leu Ser Glu Asp Thr 

275 280 285 

He He Glv Leu Leu Trp Asp Met He Thr Ala Glv Met Asp Thr Thr 

290 295 300 

Ala He Ser Val Glu Trp Ala Met Ala Glu Leu He Lvs Asn Pro Arg 
305 310 315 320 

Val Gin Gin Lvs Ala Gin Glu Glu Leu Asp Asn Val Leu Glv Ser Glu 

325 330 335 

Arg Val Leu Thr Glu Leu Asp Phe Ser Ser Leu Pro Tvr Leu Gin Cys 

340 345 350 

Val Ala Lys Glu Ala Leu Arg Leu His Pro Pro Thr Pro Leu Met Leu 
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355 360 365 

Pro His Are Ala Asn Ala Asn Val Lvs He Glv Gly Tyr Asp He Pro 

370 375 380 

Lvs Glv Ser Asn Val His Val Asn Val Trp Ala Val Ala Are Asp Pro 

385 390 395 400 

Ala Val Trp Are Asp Pro Leu Glu Phe Arg Pro Glu Are Phe Ser Glu 
405 410 415 

Asp Asp Val Asp Met Lvs Glv His Asp Tvr Are Leu Leu Pro Phe Glv 

420 425 430 

Ala Glv Are Are Val Cvs Pro Glv Ala Gin Leu Glv lie Asn Leu Val 

435 440 445 

Thr Ser Met Met Glv His Leu Leu His His Phe Tvr Trp Ser Pro Pro 

450 455 460 

Lvs Glv Val Lvs Pro Glu Glu He Asp Met Ser Glu Asn Pro Glv Leu 

465 470 475 480 

Val Thr Tvr Met Arg Thr Pro Val Gin Ala Val Pro Thr Pro Are Leu 

485 490 495 

Pro Ala His Leu Tyr Lvs Arg Val Ala Val Asp Met 

500 505 



<210> 3 
<211> 1883 
<212> DNA 
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<213> Liquidambar stvraciflua 

<220> 

<221> CDS 

<222> (74\.(\606) 

<400> 3 

tecaaacctg cacaaacaaa gagagagaag aagaaaaagg aagagaggag agagagagag 60 
agagagagaa gcc atg gat tct tct ctt cat gaa gcc ttg caa cca eta 1 09 

Met Asp Ser Ser Leu His Glu Ala Leu Gin Pro Leu 

1 5 10 

ccc atg acg ctg ttc ttc att ata cct ttg eta etc tta ttg ggc eta 157 

Pro Met Thr Leu Phe Phe He He Pro Leu Leu Leu Leu Leu Gly Leu 

15 20 25 

gta tct egg ctt cgc cag aga eta cca tac cca cca ggc cca aaa ggc 205 
Val Ser Arg Leu Arg Gin Arg Leu Pro Tvr Pro Pro Glv Pro Lvs Gly 

30 35 40 

tta ccg gtg ate gga aac atg etc atg atg gat caa etc act cac cga 253 
Leu Pro Val He Glv Asn Met Leu Met Met Asp Gin Leu Thr His Arg 

45 50 55 60 

gga etc gcc aaa etc gcc aaa caa tac ggc ggt eta ttc cac etc aag 301 
Glv Leu Ala Lvs Leu Ala Lvs Gin Tvr Glv Glv Leu Phe His Leu Lvs 
65 70 75 

atg gga ttc tta cac atg gtg gcc gtt tec aca ccc gac atg get cgc 349 
Met Glv Phe Leu His Met Val Ala Val Ser Thr Pro Asp Met Ala Arg 
80 85 90 

caa gtc ctt caa gtc caa gac aac ate ttc teg aac egg cca gcc acc 397 
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Gin Val Leu Gin Val Gin Asp Asn He Phe Ser Asn Arg Pro Ala Thr 

95 100 105 

ata gcc ate age tac etc acc tat gac cga gec gac ate ecc ttc get 445 
He Ala He Ser Tvr Leu Thr Tyr Asp Arg Ala Asp Met Ala Phe Ala 

110 115 120 

cac tac ggc ccg ttt tgg cgt cag atg cgt aaa etc tgc gtc atg aaa 493 
His Tvr Glv Pro Phe Trp Arg Gin Met Arg Lvs Leu Cvs Val Met Lvs 

125 130 135 140 

tta ttt age egg aaa cga gee gag teg tgg gag teg gtc cga gac gag 541 
Leu Phe Ser Arg Lvs Arg Ala Glu Ser Trp Glu Ser Val Arg Asp Glu 
145 150 155 

gtc gac teg gca gta cga gtg gtc gcg tec aat att ggg teg acg gtg 589 
Val Asp Ser Ala Val Arg Val Val Ala Ser Asn lie Glv Ser Thr Val 

160 165 170 

aat ate ggc gag ctg gtt ttt get ctg acg aag aat att act tac agg 637 
Asn He Glv Glu Leu Val Phe Ala Leu Thr Lvs Asn He Thr Tvr Arg 

175 180 185 

gcg get ttt ggg acg ate teg cat gag gac cag gac gag ttc gtg gcc 685 
Ala Ala Phe Glv Thr He Ser His Glu Asp Gin Asp Glu Phe Val Ala 

190 195 200 

ata ctg caa gag ttt teg cag ctg ttt ggt get ttt aat ata get gat 733 

He Leu Gin Glu Phe Ser Gin Leu Phe Glv Ala Phe Asn He Ala Asp 

205 210 215 220 

ttt ate cct tgg etc aaa tgg gtt cct cag ggg att aac gtc agg etc 78 1 
Phe He Pro Trp Leu Lvs Trp Val Pro Gin Glv He Asn Val Are Leu 
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225 230 235 



aac aag gca cga ggg gcg ctt gat ggg ttt att gac aag ate ate gac 829 
Asn Lys Ala Arg Gly Ala Leu Asp Gly Phe He Asp Lys He He Asp 

240 245 250 

gat cat ata cag aag ggg agt aaa aac teg gag gag gtt gat act gat 877 
Asp His He Gin Lvs Glv Ser Lvs Asn Ser Glu Glu Val Asp Thr Asp 

255 260 265 

atg gta gat gat tta ctt get ttt tac ggt gag gaa gec aaa gta age 925 
Met Val Asp Asp Leu Leu Ala Phe Tyr Glv Glu Glu Ala Lvs Val Ser 

270 275 280 

gaa tct gac gat ctt caa aat tec ate aaa etc acc aaa gac aac ate 973 
Glu Ser Asp Asp Leu Gin Asn Ser He Lvs Leu Thr Lys Asp Asn He 

285 290 295 300 

aaa get ate atg gac gta atg ttt gga ggg acc gaa acg gtg gcg tec 1021 
Lvs Ala He Met Asp Val Met Phe Glv Glv Thr Glu Thr Val Ala Ser 
305 310 315 

gcg att gaa tgg gec atg acg gag ctg atg aaa age cca gaa gat eta 1069 
Ala He Glu Trp Ala Met Thr Glu Leu Met Lvs Ser Pro Glu Asp Leu 

320 325 330 

aag aag gtc caa caa gaa etc gee gtg gtg gtg ggt ctt gac egg cga 1117 
Lvs Lvs Val Gin Gin Glu Leu Ala Val Val Val Glv Leu Asp Arg Arg 

335 340 345 

etc gaa gag aaa gac ttc gag aag etc acc tac ttg aaa tgc gta ctg 1165 
Val Glu Glu Lvs Asp Phe Glu Lvs Leu Thr Tvr Leu Lvs Cvs Val Leu 
350 355 360 
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aag gaa gtc ctt esc etc cac cca ccc ate cca etc etc etc cac gag 1213 
Lvs Glu Val Leu Arg Leu His Pro Pro He Pro Leu Leu Leu His Glu 

365 370 375 380 

act gee gag gac gee gag gtc ggc ggc tac tac att ccg gcg aaa teg 1261 
Thr Ala Glu Asp Ala Glu Val Glv Glv Tvr Tvr He Pro Ala Lvs Ser 
385 390 395 

egg gtg atg ate aac gcg tgc gee ate ggc egg gac aag aac teg tgg 1309 
Arg Val Met He Asn Ala Cys Ala He Glv Arg Asp Lvs Asn Ser Tip 

400 405 410 

gee gac cca gat acg ttt agg ccc tec agg ttt etc aaa gac ggt gtg 1357 
Ala Asp Pro Asp Thr Phe Arg Pro Ser Arg Phe Leu Lvs Asp Glv Val 

415 420 425 

ccc gat ttc aaa ggg aac aac ttc gag ttc ate cca ttc ggg tea ggt 1405 
Pro Asp Phe Lvs Glv Asn Asn Phe Glu Phe He Pro Phe Glv Ser Glv 

430 435 440 

cgt egg tct tgc ccc ggt atg caa etc gga etc tac gcg eta gag acg 1453 
Arg Arg Ser Cvs Pro Glv Met Gin Leu Glv Leu Tvr Ala Leu Glu Thr 

445 450 455 460 

act gtg get cac etc ctt cac tgt ttc acg tgg gag ttg ccg gac ggg 1501 
Thr Val Ala His Leu Leu His Cvs Phe Thr Trp Glu Leu Pro Asp Glv 
465 470 475 

atg aaa ccg agt gaa etc gag atg aat gat gtg ttt gga etc acc gcg 1 549 
Met Lvs Pro Ser Glu Leu Glu Met Asn Asp Val Phe Glv Leu Thr Ala 

480 485 490 

cca aga gcg att cga etc acc gee gtg ccg agt cca cgc ctt etc tgt 1 597 
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Pro Are Ala He Are Leu Thr Ala Val Pro Ser Pro Are Leu Leu Cys 

495 500 505 

cct etc tat tgatcgaatg attgggggag ctttgtggag gggcttttat 1646 

Pro Leu Tyr 
510 

ggagactcta tatatagatg ggaagtgaaa caacgacagg tgaatgcttg gatttttggt 1706 

atatattggg gagggagggg aaaaaaaaaa taatgaaagg aaagaaaaga gagaatttga 1766 

atttctcttc ctctgtggat aaaagecteg tttttaattg tttttatgtg gagatatttg 1 826 

tgtttgttta tttttatctc tttttttgea ataacactca aaaataaaaa aaaaaaa 1 883 

<210>4 

<211>5U 

<212> PRT 

<213> Liquidambar styraciflua 
<400> 4 

Met Asp Ser Ser Leu His Glu Ala Leu Gin Pro Leu Pro Met Thr Leu 

1 5 10 15 

Phe Phe lie He Pro Leu Leu Leu Leu Leu Glv Leu Val Ser Arg Leu 
20 25 30 

Arg Gin Arg Leu Pro Tyr Pro Pro Glv Pro Lvs Glv Leu Pro Val He 

35 40 45 

Glv Asn Met Leu Met Met Asp Gin Leu Thr His Arg Glv Leu Ala Lvs 

50 55 60 

Leu Ala Lvs Gin Tvr Glv Glv Leu Phe His Leu Lvs Met Glv Phe Leu 
_65 70 7J 80 

His Met Val Ala Val Ser Thr Pro Asp Met Ala Arg Gin Val Leu Gin 

62 

WAS HI 634660.1 



85 90 95 

Val Gin Asp Asn He Phe Ser Asn Are Pro Ala Thr lie Ala He Ser 

100 105 110 

Tvr Leu Thr Tvr Asp Arg Ala Asp Met Ala Phe Ala His Tyr Glv Pro 

115 120 125 

Phe Trp Arg Gin Met Are Lvs Leu Cvs Val Met Lvs Leu Phe Ser Are 

130 135 140 

Lvs Are Ala Glu Ser Trp Glu Ser Val Are Asp Glu Val Asp Ser Ala 

145 150 155 160 

Val Arg Val Val Ala Ser Asn He Glv Ser Thr Val Asn He Glv Glu 
165 170 175 

Leu Val Phe Ala Leu Thr Lvs Asn He Thr Tvr Arg Ala Ala Phe Glv 

180 185 190 

Thr He Ser His Glu Asp Gin Asp Glu Phe Val Ala He Leu Gin Glu 

195 200 205 

Phe Ser Gin Leu Phe Glv Ala Phe Asn He Ala Asp Phe He Pro Trp 

210 215 220 

Leu Lvs Trp Val Pro Gin Glv He Asn Val Arg Leu Asn Lvs Ala Arg 

225 230 235 240 

Glv Ala Leu Asp Glv Phe He Asp Lvs He He Asp Asp His He Gin 
245 250 255 

Lvs Glv Ser Lvs Asn Ser Glu Glu Val Asp Thr Asp Met Val Asp Asp 

260 265 270 

Leu Leu Ala Phe Tvr Glv Glu Glu Ala Lvs Val Ser Glu Ser Asp Asp 
275 280 285 
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Leu Gin Asn Ser He Lvs Leu Thr Lvs Asp Asn He Lvs Ala He Met 

290 295 300 

Asp Val Met Phe Glv Glv Thr Glu Thr Val Ala Ser Ala He Glu Trp 
305 310 315 320 

Ala Met Thr Glu Leu Met Lvs Ser Pro Glu Asp Leu Lvs Lvs Val Gin 
325 330 335 

Gin Glu Leu Ala Val Val Val Glv Leu Asp Are Are Val Glu Glu Lvs 

340 345 350 

Asp Phe Glu Lvs Leu Thr Tvr Leu Lvs Cvs Val Leu Lvs Glu Val Leu 

355 360 365 

Are Leu His Pro Pro He Pro Leu Leu Leu His Glu Thr Ala Glu Asp 

370 375 380 

Ala Glu Val Glv Glv Tvr Tvr lie Pro Ala Lvs Ser Are Val Met He 

385 390 395 400 

Asn Ala Cvs Ala He Glv Are Asp Lvs Asn Ser Trp Ala Asp Pro Asp 
405 4J0 415 

Thr Phe Are Pro Ser Are Phe Leu Lvs Asp Glv Val Pro Asp Phe Lvs 

420 425 430 

Glv Asn Asn Phe Glu Phe He Pro Phe Glv Ser Glv Are Are Ser Cvs 

435 440 445 

Pro Glv Met Gin Leu Glv Leu Tvr Ala Leu Glu Thr Thr Val Ala His 

450 455 460 

Leu Leu His Cvs Phe Thr Trp Glu Leu Pro Asp Glv Met Lvs Pro Ser 

465 470 475 480 

Glu Leu Glu Met Asn Asp Val Phe Glv Leu Thr Ala Pro Are Ala He 
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485 490 495 

Are Leu Thr Ala Val Pro Ser Pro Are Leu Leu Cvs Pro Leu Tvr 
500 505 510 



<210>5 
<2U> 1380 
<212>DNA 

<213> Liquidambar stvraciflua 

<220> 

<221>CDS 

<222> (6T)JWO) 

<400> 5 

cggcacgagc cctacctcct ttcttggaaa aatttcccca ttcgatcaca atccgggcct 60 
caaaaa atg gga tea aca age gaa acg aag atg age ccg agt gaa gca 108 
Met Glv Ser Thr Ser Glu Thr Lvs Met Ser Pro Ser Glu Ala 

1 5 10 

gca gca gca gaa gaa gaa gca ttc eta ttc get atg caa tta ace agt 1 56 
Ala Ala Ala Glu Glu Glu Ala Phe Val Phe Ala Met Gin Leu Thr Ser 

J_5 20 25 30 

get tea gtt ctt ccc atg gtc eta aaa tea gec ata gag etc gac gtc 204 
Ala Ser Val Leu Pro Met Val Leu Lvs Ser Ala He Glu Leu Asp Val 
35 40 45 

tta gaa ate atg get aaa get ggt cca get gcg cac ata tec aca tct 252 
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Leu Glu He Met Ala Lvs Ala Glv Pro Glv Ala His He Ser Thr Ser 

50 55 60 

gac ata gcc tct aag ctg ccc aca aag aat cca gat gca gcc gtc atg 300 
Asp He Ala Ser Lvs Leu Pro Thr Lvs Asn Pro Asp Ala Ala Val Met 

65 70 75 

ctt gac cgt atg etc cgc etc ttg get age tac tct gtt eta acg tgc 348 

Leu Asp Arg Met Leu Arg Leu Leu Ala Ser Tvr Ser Val Leu Thr Cvs 

80 85 90 

tct etc cgc acc etc cct gac ggc aag ate gag agg ctt tac ggc ctt 396 
Ser Leu Arg Thr Leu Pro Asp Glv Lvs He Glu Arg Leu Tvr Glv Leu 

_95 100 105 110 

gca ccc gtt tgt aaa ttc ttg acc aga aac gat gat gga gtc tec ata 444 
Ala Pro Val Cvs Lvs Phe Leu Thr Arg Asn Asp Asp Glv Val Ser He 
U5 120 125 

gcc get ctg tct etc atg aat caa gac aag gtc etc atg gag age tgg 492 
Ala Ala Leu Ser Leu Met Asn Gin Asp Lvs Val Leu Met Glu Ser Trp 

130 135 140 

tac cac ttg acc gag gca gtt ctt gaa ggt gga att cca ttt aac aag 540 
Tvr His Leu Thr Glu Ala Val Leu Glu Glv Glv He Pro Phe Asn Lvs 

145 150 155 

gcc tat gga atg aca gca ttt gag tac cat ggc acc gat ccc aga ttc 588 
Ala Tvr Glv Met Thr Ala Phe Glu Tvr His Glv Thr Asp Pro Arg Phe 

160 165 170 

aac aca gtt ttc aac aat gga atg tec aat cat teg acc att acc atg 636 
Asn Thr Val Phe Asn Asn Glv Met Ser Asn His Ser Thr He Thr Met 
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175 180 185 190 



aag aaa ate ctt gag act tac aaa ggg ttc gag gga ctt gga tct gtg 684 
Lvs Lvs He Leu Glu Thr Tvr Lvs Gly Phe Glu Glv Leu Glv Ser Val 

195 200 205 

gtt gat gtt ggt ggt ggc act ggt gec cac ctt aac atg att ate get 732 
Val Asp Val Glv Glv Glv Thr Glv Ala His Leu Asn Met He He Ala 

210 215 220 

aaa tac ccc atg ate aag ggc att aac ttc gac ttg cct cat gtt att 780 
Lvs Tvr Pro Met He Lvs Glv He Asn Phe Asp Leu Pro His Val He 

225 230 235 

gag gag get ccc tec tat cct ggt gtg gag cat gtt ggt gga gat atg 828 
Glu Glu Ala Pro Ser Tvr Pro Glv Val Glu His Val Glv Glv Asp Met 

240 245 250 

ttt gtt agt gtt cca aaa gga gat gec att ttc atg aag tgg ata tgt 876 
Phe Val Ser Val Pro Lvs Glv Asp Ala He Phe Met Lvs Trp He Cvs 

255 260 265 270 

cat gat tgg age gat gaa cac tgc ttg aag ttt ttg aag aaa tgt tat 924 

His Asp Trp Ser Asp Glu His Cvs Leu Lvs Phe Leu Lvs Lvs Cvs Tvr 

_275 280 285 

gaa gca ctt cca acc aat ggg aag gtg ate ctt get gaa tgc ate etc 972 
Glu Ala Leu Pro Thr Asn Glv Lvs Val He Leu Ala Glu Cvs He Leu 

290 295 300 

ccc gtg gcg cca gac gca age etc ccc act aag gca gtg gtc cat att 1020 
Pro Val Ala Pro Asp Ala Ser Leu Pro Thr Lvs Ala Val Val His He 

305 310 315 
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gat etc ate atg ttg get cat aac cca ggt ggg aaa gag aga act gag 1 068 
Asp Val lie Met Leu Ala His Asn Pro Glv Glv Lvs Glu Are Thr Glu 

320 325 330 

aag gag ttt gag gee ttg gec aag ggg get gga ttt gaa ggt ttc cga 1116 
Lvs Glu Phe Glu Ala Leu Ala Lvs Glv Ala Glv Phe Glu Glv Phe Arg 

335 340 345 350 

gta gta gee teg tgc get tac aat aca tgg ate ate gaa ttt ttg aag 1 1 64 
Val Val Ala Ser Cvs Ala Tyr Asn Thr Trp He lie Glu Phe Leu Lvs 

355 360 365 

aag art tgagtcctta cteggctttg agtacataat accaactcct tttggttttc 1220 
Lvs He 

gagattgtga ttgtgattgt gattgtctct etttegcagt tggccttatg atataatgta 1280 
tegttaaetc gatcacagaa gtgeaaaaga cagtgaatgt acactgettt ataaaataaa 1340 

aattttaaga ttttgattca tgtaaaaaaa aaaaaaaaaa 1380 

<210>6 
<211>368 
<212> PRT 

<213> Liquidambar styraciflua 
<400> 6 

Met Glv Ser Thr Ser Glu Thr Lvs Met Ser Pro Ser Glu Ala Ala Ala 
_1 5 10 L5 

Ala Glu Glu Glu Ala Phe Val Phe Ala Met Gin Leu Thr Ser Ala Ser 

20 25 30 

Val Leu Pro Met Val Leu Lvs Ser Ala He Glu Leu Asp Val Leu Glu 
35 40 45 
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He Met Ala Lvs Ala Glv Pro Glv Ala His He Ser Thr Ser Asp He 

50 55 60 

Ala Ser Lvs Leu Pro Thr Lvs Asn Pro Asp Ala Ala Val Met Leu Asp 
65 70 75 80 

Are Met Leu Are Leu Leu Ala Ser Tyr Ser Val Leu Thr Cvs Ser Leu 
85 90 95 

Are Thr Leu Pro Asp Glv Lvs He Glu Are Leu Tvr Glv Leu Ala Pro 

100 105 110 

Val Cvs Lvs Phe Leu Thr Are Asn Asp Asp Glv Val Ser He Ala Ala 

115 120 125 

Leu Ser Leu Met Asn Gin Asp Lvs Val Leu Met Glu Ser Trp Tyr His 

130 135 140 

Leu Thr Glu Ala Val Leu Glu Glv Glv He Pro Phe Asn Lvs Ala Tvr 
145 150 155 160 

Glv Met Thr Ala Phe Glu Tvr His Glv Thr Asp Pro Are Phe Asn Thr 
165 170 175 

Val Phe Asn Asn Glv Met Ser Asn His Ser Thr He Thr Met Lvs Lvs 

180 185 190 

He Leu Glu Thr Tvr Lvs Glv Phe Glu Glv Leu Glv Ser Val Val Asp 

195 200 205 

Val Glv Glv Glv Thr Glv Ala His Leu Asn Met He He Ala Lvs Tvr 

210 215 220 

Pro Met He Lvs Glv He Asn Phe Asp Leu Pro His Val He Glu Glu 

225 230 235 240 

Ala Pro Ser Tvr Pro Glv Val Glu His Val Glv Glv Asp Met Phe Val 
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245 250 255 

Ser Val Pro Lvs Glv Asp Ala He Phe Met Lvs Trp He Cvs His Asp 

260 265 270 

Trp Ser Asp Glu His Cys Leu Lvs Phe Leu Lvs Lvs Cvs Tvr Glu Ala 

275 280 285 

Leu Pro Thr Asn Glv Lvs Val He Leu Ala Glu Cvs He Leu Pro Val 

290 295 300 

Ala Pro Asp Ala Ser Leu Pro Thr Lvs Ala Val Val His He Asp Val 

305 310 3\5 320 

He Met Leu Ala His Asn Pro Glv Glv Lvs Glu Are Thr Glu Lvs Glu 
325 330 335 

Phe Glu Ala Leu Ala Lvs Glv Ala Glv Phe Glu Glv Phe Are Val Val 

340 345 350 

Ala Ser Cvs Ala Tvr Asn Thr Trp He He Glu Phe Leu Lvs Lvs He 
355 360 365 



<210> 7 
<21 1> 2025 
<212> DNA 

<213> Liquidambar styraciflua 

<220> 

<221>CDS 

<222> (60 s )..(1679'> 
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<400> 7 

cggcacgagc tcattttcca cttctggttt gatctctgca attcttccat cagtcccta 59 
atg gag acc caa aca aaa caa gaa gaa ate ata tat egg teg aaa etc 107 
Met Glu Thr Gin Thr Lvs Gin Glu Glu lie lie Tvr Arg Ser Lvs Leu 

_l 5 10 L5 

ccc gat ate tac ate ccc aaa cac etc cct tta cat teg tat tgt ttc 1 55 
Pro Asp He Tvr He Pro Lvs His Leu Pro Leu His Ser Tvr Cvs Phe 

20 25 30 

gag aac ate tea cag ttc ggc tec cgc ccc tgt ctg ate aat ggc gca 203 
Glu Asn He Ser Gin Phe Glv Ser Arg Pro Cvs Leu He Asn Glv Ala 

35 40 45 

acg ggc aag tat tac aca tat get gag gtt gag etc att gcg cgc aag 251 
Thr Glv Lvs Tvr Tvr Thr Tvr Ala Glu Val Glu Leu He Ala Arg Lvs 

50 55 60 

gtc gca tec ggc etc aac aaa etc ggc gtt cga caa ggt gac ate ate 299 
Val Ala Ser Glv Leu Asn Lvs Leu Glv Val Arg Gin Glv Asp He He 

_65 70 75 80 

atg ctt ttg eta ccc aac teg ccg gag ttc gtg ttt tea att etc ggc 347 
Met Leu Leu Leu Pro Asn Ser Pro Glu Phe Val Phe Ser He Leu Glv 
85 90 95 

gca tec tac cgc ggg get gec gec acc gee gca aac ccg ttt tat acc 395 
Ala Ser Tvr Arg Glv Ala Ala Ala Thr Ala Ala Asn Pro Phe Tvr Thr 

100 105 110 

cct gec gag ate agg aag caa gec aaa acc tec aac gec agg ctt att 443 
Pro Ala Glu He Arg Lvs Gin Ala Lvs Thr Ser Asn Ala Arg Leu He 
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115 120 125 



ate aca cat gec tgt tac tat gag aaa gtg aag gac ttg gtg gaa gag 491 
He Thr His Ala Cvs Tvr Tvr Glu Lvs Val Lvs Asp Leu Val Glu Glu 

130 L35 140 

aac gtt gec aag ate ata tgt ata gac tea ccc ccg gac ggt tgt ttg 539 
Asn Val Ala Lvs He He Cvs He Asp Ser Pro Pro Asp Glv Cvs Leu 

145 150 155 160 

cac ttc teg gag ctg agt gag gcg gac gag aac gac atg ccc aat gta 587 
His Phe Ser Glu Leu Ser Glu Ala Asp Glu Asn Asp Met Pro Asn Val 
165 170 175 

gag art gac ccc gat gat gtg gtg gcg ctg ccg tac teg tea ggg acg 635 
Glu He Asp Pro Asp Asp Val Val Ala Leu Pro Tvr Ser Ser Glv Thr 

180 185 190 

acg ggt tta cca aag ggg gtg atg eta aca cac aag gga caa gtg acg 683 
Thr Glv Leu Pro Lvs Glv Val Met Leu Thr His Lvs Glv Gin Val Thr 

195 200 205 

agt gtg gcg caa cag gtg gac gga gag aat ccg aac ctg tat ata cat 731 
Ser Val Ala Gin Gin Val Asp Glv Glu Asn Pro Asn Leu Tvr He His 

210 215 220 

age gag gac gtg gtt ctg tgc gtg ttg cct ctg ttt cac ate tac teg 779 
Ser Glu Asp Val Val Leu Cvs Val Leu Pro Leu Phe His He Tvr Ser 

225 230 235 240 

atg aac gtc atg ttt tgc ggg tta cga gtt ggt gcg gcg att ctg art 827 
Met Asn Val Met Phe Cvs Glv Leu Arg Val Glv Ala Ala He Leu He 
245 250 255 
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ate cag aaa ttt gaa ata tat ggg ttg tta gag ctg etc aga agt aca 875 
Met Gin Lvs Phe Glu He Tvr Glv Leu Leu Glu Leu Val Are Ser Thr 

260 265 270 

ggt gac cat cat gcc tat cgt aca ccc ate gta ttg gca ate tec aag 923 
Glv Asp His His Ala Tyr Arg Thr Pro He Val Leu Ala He Ser Lvs 

275 280 285 

act ccg gat ctt cac aac tat gat gtg tec tec att egg act gtc atg 971 
Thr Pro Asp Leu His Asn Tvr Asp Val Ser Ser He Arg Thr Val Met 

290 295 300 

tea ggt gcg get cct ctg ggc aag gaa ctt gaa gat tct gtc aga get 1019 
Ser Glv Ala Ala Pro Leu Glv Lvs Glu Leu Glu Asp Ser Val Arg Ala 

305 310 315 320 

aag ttt ccc acc gcc aaa ctt ggt cag gga tat gga atg acg gag gca 1067 
Lvs Phe Pro Thr Ala Lvs Leu Glv Gin Glv Tvr Glv Met Thr Glu Ala 
325 330 335 

ggg ccc gtg eta gcg atg tgt ttg gca ttt gcc aag gaa ggg ttt gaa 1115 
Glv Pro Val Leu Ala Met Cvs Leu Ala Phe Ala Lvs Glu Glv Phe Glu 

340 345 350 

ata aaa teg ggg gca tct gga act gtt tta agg aac gca cag atg aag 1 1 63 
He Lvs Ser Glv Ala Ser Glv Thr Val Leu Arg Asn Ala Gin Met Lvs 

355 360 365 

att gtg gac cct gaa acc ggt gtc act etc cct cga aac caa ccc gga 1211 
He Val Asp Pro Glu Thr Glv Val Thr Leu Pro Arg Asn Gin Pro Glv 

370 375 380 

gag att tgc att aga gga gac caa ate atg aaa ggt tat ctt aat gat 1 259 
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Glu He Cvs He Are Glv Asp Gin He Met Lvs Glv Tvr Leu Asn Asp 

385 390 395 400 

cct gag gcg acg gag aga acc ata gac aag gaa get tgg tta cac aca 1307 
Pro Glu Ala Thr Glu Arg Thr He Asp Lvs Glu Glv Trp Leu His Thr 
405 410 415 

ggt gat gtg ggc tac ate gac gat gac act gag etc ttc att gtt gat 1355 
Glv Asp Val Glv Tvr He Asp Asp Asp Thr Glu Leu Phe He Val Asp 

420 425 430 

egg ttg aag gaa ctg ate aaa tac aaa ggg ttt cag gtg gca ccc get 1403 
Arg Leu Lvs Glu Leu He Lvs Tvr Lvs Glv Phe Gin Val Ala Pro Ala 

435 440. 445 

gag ctt gag gec atg etc etc aac cat ccc aac ate tct gat get gec 1451 
Glu Leu Glu Ala Met Leu Leu Asn His Pro Asn He Ser Asp Ala Ala 

450 455 460 

gtc gtc cca atg aaa gac gat gaa get gga gag etc cct gtg gcg ttt 1499 
Val Val Pro Met Lvs Asp Asp Glu Ala Glv Glu Leu Pro Val Ala Phe 

465 470 475 480 

gtt gta aga tea gat ggt tct cag ata tec gag get gaa ate agg caa 1 547 
Val Val Arg Ser Asp Glv Ser Gin He Ser Glu Ala Glu He Arg Gin 

485 490 495 

tac ate gca aaa cag gtg gtt ttt tat aaa aga ata cat cgc gta ttt 1 595 
Tvr He Ala Lvs Gin Val Val Phe Tvr Lvs Arg He His Arg Val Phe 

500 505 510 

ttc gtc gaa gee att cct aaa gcg ccc tct ggc aaa ate ttg egg aag 1643 
Phe Val Glu Ala He Pro Lvs Ala Pro Ser Glv Lvs He Leu Arg Lvs 
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515 520 525 



gac ctg aga gcc aaa ttg gcg tct ggt ctt ccc aat taattctcat 1689 
Asp Leu Arg Ala Lys Leu Ala Ser Glv Leu Pro Asn 

530 535 540 

tcgctaccct cctttctctt atcatacgcc aacacgaacg aagaggctca attaaacgct 1 749 
gctcattcga agcggctcaa ttaaagctgc tcattcatgt ccaccgagtg ggcagcctgt 1809 
cttgttggga tgttctttca tttgattcag ctgtgagaag ccagaccctc attatttatt 1 869 
gtgaaattca caagaatgtc tgtaaatcga tgttgtgagt gatgggtttc aaaacacttt 1929 
tgacattgtt tacgttgtat ttcctgctgt tgaaaataac tactttgtat gacttttatt 1989 

tgggaagata acctttcaaa aaaaaaaaaa aaaaaa 2025 

<210>8 
<211> 540 
<212> PRT 

<213> Liquidambar stvraciflua 
<400> 8 

Met Glu Thr Gin Thr Lvs Gin Glu Glu He He Tvr Arg Ser Lvs Leu 
_1 5 10 15 

Pro Asp He Tvr He Pro Lvs His Leu Pro Leu His Ser Tvr Cvs Phe 
20 25 30 

Glu Asn He Ser Gin Phe Glv Ser Arg Pro Cvs Leu He Asn Glv Ala 

35 40 45 

Thr Glv Lvs Tvr Tvr Thr Tvr Ala Glu Val Glu Leu He Ala Arg Lvs 

50 55 60 

Val Ala Ser Glv Leu Asn Lvs Leu Glv Val Arg Gin Glv Asp He He 
_65 70 75 80 
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Met Leu Leu Leu Pro Asn Ser Pro Glu Phe Val Phe Ser lie Leu Glv 



85 90 95 

Ala Ser Tvr Are Glv Ala Ala Ala Thr Ala Ala Asn Pro Phe Tvr Thr 

100 105 110 

Pro Ala Glu He Arg Lvs Gin Ala Lvs Thr Ser Asn Ala Are Leu He 

115 120 125 

He Thr His Ala Cvs Tvr Tvr Glu Lvs Val Lvs Asp Leu Val Glu Glu 

130 135 140 

Asn Val Ala Lvs He He Cvs He Asp Ser Pro Pro Asp Glv Cvs Leu 
145 150 155 160 

His Phe Ser Glu Leu Ser Glu Ala Asp Glu Asn Asp Met Pro Asn Val 
165 170 175 

Glu He Asp Pro Asp Asp Val Val Ala Leu Pro Tvr Ser Ser Glv Thr 

180 185 190 

Thr Glv Leu Pro Lvs Glv Val Met Leu Thr His Lvs Glv Gin Val Thr 

195 200 205 

Ser Val Ala Gin Gin Val Asp Glv Glu Asn Pro Asn Leu Tvr He His 

210 215 220 

Ser Glu Asp Val Val Leu Cvs Val Leu Pro Leu Phe His He Tvr Ser 

225 230 235 240 

Met Asn Val Met Phe Cvs Glv Leu Arg Val Glv Ala Ala He Leu He 
245 250 255 

Met Gin Lvs Phe Glu lie Tvr Glv Leu Leu Glu Leu Val Are Ser Thr 

260 265 270 

Glv Asp His His Ala Tvr Arg Thr Pro He Val Leu Ala He Ser Lvs 
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275 280 285 

Thr Pro Asp Leu His Asn Tvr Asp Val Ser Ser lie Are Thr Val Met 

290 295 300 

Ser Glv Ala Ala Pro Leu Gly Lys Glu Leu Glu Asp Ser Val Are Ala 

305 310 315 320 

Lys Phe Pro Thr Ala Lvs Leu Glv Gin Glv Tvr Glv Met Thr Glu Ala 
325 330 335 

Glv Pro Val Leu Ala Met Cvs Leu Ala Phe Ala Lvs Glu Glv Phe Glu 

340 345 350 

He Lvs Ser Glv Ala Ser Glv Thr Val Leu Are Asn Ala Gin Met Lvs 

355 360 365 

He Val Asp Pro Glu Thr Glv Val Thr Leu Pro Are Asn Gin Pro Glv 

370 375 380 

Glu He Cvs He Are Gly Asp Gin He Met Lvs Glv Tvr Leu Asn Asp 

385 390 395 400 

Pro Glu Ala Thr Glu Are Thr He Asp Lvs Glu Glv Trp Leu His Thr 
405 410 415 

Glv Asp Val Glv Tvr He Asp Asp Asp Thr Glu Leu Phe He Val Asp 

420 425 430 

Are Leu Lvs Glu Leu He Lvs Tvr Lys Glv Phe Gin Val Ala Pro Ala 

435 440 445 

Glu Leu Glu Ala Met Leu Leu Asn His Pro Asn He Ser Asp Ala Ala 

450 455 460 

Val Val Pro Met Lvs Asp Asp Glu Ala Glv Glu Leu Pro Val Ala Phe 
465 470 475 480 
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Val Val Arg Ser Asp GIv Ser Gin He Ser Glu Ala GIu He Are Gin 
485 490 495 

Tvr He Ala Lvs Gin Val Val Phe Tyr Lvs Arg He His Arg Val Phe 

500 505 510 

Phe Val Glu Ala He Pro Lvs Ala Pro Ser Glv Lvs He Leu Arg Lvs 

515 520 525 

Asp Leu Arg Ala Lvs Leu Ala Ser Glv Leu Pro Asn 
530 535 540 



<210>9 
<211> 1544 
<212> DNA 
<213> Pinus taeda 
<400> 9 

aaagataata tatgtgtatg cctactacta cacattgttt tgaagtgtgt aaacatagtg 60 
caacactagg aggactcaca atgagcactt gttgacatga aactagctaa atgcccaaca 120 
atattagtga aagctagtta aactaacccc tttgactttc aagatgatat atttatatcc 180 
ctactacgtc ttcctctttt tgtctttctc ttgtgattaa accttccttg aaacaattct 240 
caaatgtaaa attaaacctt gaaacttgta gagaccaaac ttccctagga gaaaccacat 300 
ttatgacaac atatatacac caacccattg catactataa tattggaatt acctgcagcg 360 
aacgaaagaa acgctgtctc accaactcgt gcactacatc ccgaaactta accttcccct 420 
gatacagatt gaagagccga aaaaagcgtg catccaaatt tctggtatgg tgaggagccg 480 
aaaaacgcgt gcgcctaatt tttttgagat gggccggaaa ataatgcgtg catctaaatt 540 
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ttcacgtgtc gcgtattggc gaggttgcgc tgaatgtgat cctgtgcgtg agccacattc 600 
attccattgg ttgacccgcc ggtaccgcga ggaccgtggg gtctcacaga tacgcggatg 660 
gtggatcagc actgagaaga ttagatgatg accaggcggg catttgaagt aaaaacttgg 720 
gggtggttgg caagtacgcg acaaagaggg gtagtgcgca aggaagcgag ttggatgcaa 780 
ataatattac aaagtgggtt ggtgggcatg agcatcaacc agaatgatgt tgttgctggt 840 
tccgtgcaaa ttctgaccag tagtttgaac aatactaccc aacttgtttt tggtaaaaca 900 
tgaagtgggt aaggagaatt gaacttacgt ctcatggtaa agggcaaggg caaatgactt 960 
aacacatacc tttaactaat aaaaataccc ctaacaaata cgaaaacgaa tgagttatca 1020 
cagaccttca actaataaga tagccatcag acccacatct cctgactgac caaaaacaaa 1080 
tgacttcaac caactaagat acccatcaaa gctaacccac aacccaattc ctcacttccc 1 140 
cttaccagac caaccaagca gacctacgcc attaactact ttaggacgtg ggaattgggg 1200 
gtgccaccgt tgaagaatgg cactcagggt tggtaatccc tccacgtgta tgtagcagtc 1260 
gtttggtgga gacggcgtgt ttgaatgtcc accttccagt ttggagaaca aggaaattgg 1320 
gcttatatta ggcctggatc tcttgtttca gagcaggagt agttcaggac aggaactagc 1380 
attcaagaat tcaattgccc tgccctgctc tgctctgctt tgctcaactt attgatccct 1440 
gctctggttt gttcaatttc ttgacccctg ctgggttctg ctctggtttg cacactttct 1500 

cgattatata agtcattttg gatccttgca aggaagagaa tatg 1544 

<210> 10 
<211>659 
<212> DNA 
<213> Pinus taeda 
<400> 10 

aaacaccaat ttaatgggat ttcagatttg tatcccatgc tattggctaa ggcatttttc 60 
ttattgtaat ctaaccaatt ctaatttcca ccctggtgtg aactgactga caaatgcggt 120 
ccgaaaacag cgaatgaaat gtctgggtga tcggtcaaac aagcggtggg cgagagagcg 180 
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cgggtgttgg cctagccggg atgggggtag gtagacggcg tattaccggc gagttgtccg 240 
aatggagttt tcggggtagg tagtaacgta gacgtcaatg gaaaaagtca taatctccgt 300 
caaaaatcca accgctcctt cacatcgcag agttggtggc cacgggaccc tccacccact 360 
cactcaatcg atcgcctgcc gtggttgccc attattcaac catacgccac ttgactcttc 420 
accaacaatt ccaggccggc tttctataca atgtactgca caggaaaatc caatataaaa 480 
agccggcctc tgcttccttc tcagtagccc ccagctcatt caattcttcc cactgcaggc 540 
tacatttgtc agacacgttt tccgccattt ttcgcctgtt tctgcggaga atttgatcag 600 
gttcggattg ggattgaatc aattgaaagg tttttatttt cagtatttcg atcgccatg 659 
<210> 11 
<211> 2251 
<212> DNA 
<213> Pinus taeda 
<400> 1 1 

ggccgggtgg tgacatttat tcataaattc atctcaaaac aagaaggatt tacaaaaata 60 
aaagaaaaca aaattttcat ctttaacata attataattg tgttcacaaa attcaaactt 120 
aaacccttaa tataaagaat ttctttcaac aatacacttt aatcacaact tcttcaatca 1 80 
caacctcctc caacaaaatt aaaatagatt aataaataaa taaacttaac tatttaaaaa 240 
aaaatattat acaaaaffla ttaaaacttc aaaataaaca aactttttat acaaaattca 300 
tcaaaacttt aaaataaagc taaacactga aaatgtgagt acatttaaaa ggacgctgat 360 
cacaaaaatt ttgaaaacat aaacaaactt gaaactctac cttttaagaa tgagtttgtc 420 
gtctcattaa ctcattagtt ttatagttcg aatccaatta acgtatcttt tattttatgg 480 
aataagggtg ttttaataag tgattttggg atttttttag taatttattt gtgatatgtt 540 
atggaetttt taaaaatata tatatatata tatatttttg ggttgagttt acttaaaatt 600 
tggaaaaggt tggtaagaac tataaattga gttgtgaatg agtgttttat ggatttttta 660 
agatgttaaa tttatatatg taattaaaat tttattttga ataacaaaaa ttataattgg 720 
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ataaaaaatt gttttgttaa atttagagta aaaatttcaa aatctaaaat aattaaacac 780 
tattattttt aaaaaatttg ttggtaaatt ttatcttata tttaagttaa aatttagaaa 840 
aaattaattt taaattaata aacttttgaa gtcaaatatt ccaaatattt tccaaaatat 900 
taaatctatt ttgcattcaa aatacaattt aaataataaa acttcatgga atagattaac 960 
caatttgtat aaaaaccaaa aatctcaaat aaaatttaaa ttacaaaaca ttatcaacat 1020 
tatgatttca agaaagacaa taaccagttt ccaataaaat aaaaaacctc atggcccgta 1080 
attaagatct cattaattaa ttcttatttt ttaatttttt tacatagaaa atatctttat 1 140 
attgtatcca agaaatatag aatgttctcg tccagggact attaatctcc aaa caagttt 1200 
caaaatcatt acattaaagc tcatcatgtc atttgtggat tggaaattat attg tataag 1260 
agaaatatag aatgttctcg tctagggact attaatttcc aaacaaattt caaaatcatt 1320 
acattaaagc tcatcatgtc atttgtggat tggaaattag acaaaaaaaa tcccaaatat 1380 
ttctctcaat ctcccaaaat atagttcgaa ctccatattt ttggaaattg agaatttttt 1440 
tacccaataa tatatttttt tatacatttt agagattttc cagacatatt tgctctggga 1500 
tttattggaa tgaaggttga gttataaact ttcagtaatc caagtatctt cggtttttga 1560 
agatactaaa tccattatat aataaaaaca cattttaaac accaatttaa tgggatttca 1620 
gatttgtatc ccatgctatt ggctaaggca tttttcttat tgtaatctaa ccaattctaa 1680 
tttccaccct ggtgtgaact gactgacaaa tgcggtccga aaacagcgaa tgaaatgtct 1740 
gggtgatcgg tcaaacaagc ggtgggcgag agagcgcggg tgttggccta gccgggatgg 1800 
gggtaggtag acggcgtatt accggcgagt tgtccgaatg gagttttcgg ggtaggtagt 1860 
aacgtagacg tcaatggaaa aagtcataat ctccgtcaaa aatccaaccg ctccttcaca 1920 
tcgcagagtt ggtggccacg ggaccctcca cccactcact cgatcgcctg ccgtggttgc 1980 
ccattattca accatacgcc acttgactct tcaccaacaa ttccaggccg g ctttctata 2040 
caatgtactg cacaggaaaa tccaatataa aaagccggcc tctgcttcct tctcagtag c 2100 
ccccagctca ttcaattctt cccactgcag gctacatttg tcagacacgt tttccgc cat 2160 
ttttcgcctg tttctgcgga gaatttgatc aggttcggat tgggattgaa tca attgaaa 2220 
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ggtttttatt ttcagtattt cgatceccat e 2251 

<210> 12 
<211>20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

peptide 
<400> 12 

Glv Glv Met Ala Thr Tvr Cvs Cvs Ala Thr Thr Tvr Ala Ala Cvs Ala 

1 5 10 15 

Ala Glv Glv Cvs 

20 

<210> 13 
<21 1> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

peptide 
<400> 13 

Ala Ala Ala Glv Ala Glv Ala Glv Asn Ala Cvs Asn Asn Ala Asn Asn 

_J 5 10 15 

Ala Asn Glv Ala 
20 
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<210> 14 
<211>31 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

peptide 
<400> 14 

Thr Thr Glv Glv Ala Thr Cvs Cvs Glv Glv lie Ala Cvs lie Ala Cvs 

_J 5 JO 15 

lie Glv Glv He Tvr Thr lie Cvs Cvs He Ala Ala Are Glv Glv 

20 25 30 

<210> 15 
<211> 31 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

peptide 
<400> 15 

Thr Thr Glv Glv Ala Thr Cvs Cvs Glv Thr He Glv Thr He Glv Cvs 

_J 5 10 I! 

He Cvs Ala Are Cvs Ala Are Glv Thr He Glv Ala Tvr Glv Glv 

20 25 30 

<210> 16 
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<21 1> 27 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

peptide 
<400> 16 

Cvs Cvs He Cvs Thr Tvr Thr Ala Asp Ala Cvs Arg Thr Ala Asp Gly 

_J 5 10 15 

Cvs He Cvs Cvs Ala Glv Cvs Thr Glv Thr Ala 

20 25 

<210> 17 
<211> 15 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 

primer 
<400> 17 

tttttttttt tttta I J 

<210> 18 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Synthetic 

primer 
<400> 18 

tttttttttt ttttc - 15. 

<210> 19 
<211> 15 
<212>DNA 

<2 1 3> Artificial Sequence 
<22Q> 

<223> Description of Artificial Sequence: Synthetic 

primer 
<400> 19 

tttttttttt ttttg 15_ 

<210>20 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

peptide 
<400> 20 

Cvs Cvs Asn Glv Glv Asn Glv Glv Ser Ala Are Gly Ala 

_J 5 10 

<210>21 
<211>9 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

peptide 
<220> 

<221>MOD RES 
<222> (31 

<223> Variable amino acid 
<220> 

<221>MOD RES 

<222> (5)..(6) 

<223> Variable amino acid 

<220> 

<221>MOD RES 
<222> (S) 

<223> Variable amino acid 
<400>21 

Phe Glv Xaa Glv Xaa Xaa Cvs Xaa Glv 

J 5 

<210> 22 
<211>26 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Synthetic 

primer ' 
<220> 

<221> modified base 
<222> (231 
<223> Inosine 
<400> 22 

atgtgcagtt tttttttttt ttnttt 26 

<210>23 
<211>26 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer 
<400> 23 

atggctttcc ttctaatacc catctc 26 

<210> 24 
<211>26 
<212>DNA 

<2 1 3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

primer 
<400> 24 
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gggtgtaatg gacgagcaag gacttg 



26 
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What is claimed is: 

1 . A method for modifying the genome of a gymnosperm which comprises 
cloning one or more angiosperm DNA sequences which code for genes necessary for 
production of angiosperm syringyl lignin monomer units, fusing one or more of the 
angiosperm DNA sequences to a promoter region associated with a gene to form an 
expression cassette and inserting the expression cassette into the gymnosperm to thereby 
produce a modified genome in the gymnosperm containing genes which code for enzymes 
which produce syringyl lignin monomer units. 

2. The method of claim 1, further comprising incorporating a genetic sequence 
which codes for anti-sense mRNA into tile gymnosperm in order to suppress formation of i 
guaiacyl lignin monomer units. 

3. A gymnosperms plant containing an expression cassette produced according 
to the method of claim 1 . 

4. A loblolly pine containing an expression cassette produced according to the 
method of claim 1. 

5. The method of claim 1 wherein the angiosperm DNA sequences are 
selected from the class consisting of 4-coumarate CoA ligase (4CL), bifunctional-O-methyl 
transferase (bi-OMT) and ferulic acid-5-hydroxylase (FA5H-1 and FA5H-2). 

6. The method of claim 1 wherein the promoter region is selected from the 
class consisting of the 5 f flanking region of phenylalanine ammonia-lyase (PAL) and the 5' 
flanking region of 4-coumarate CoA ligase (4CL1B and 4CL3B). 

7. The method of claim 1 wherein the expression cassette is inserted into the 
gymnosperm genome by way of the transformation vector Agrobacterium. 

8. The method of claim 7 wherein the Agrobacterium is Agrobacterium 
tumefaciens EH101. 
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9. The method of claim 1 wherein the expression cassette is inserted into the 
gymnosperm genome via direct DNA delivery to a target cell. 

10. The method of claim 1 wherein expression cassette is inserted into the 
gymnosperm genome by micro-projectile bombardment of a gymnosperm cell. 

1 1 . The method of claim 1 wherein the expression cassette is inserted into the 
gymnosperm genome by electroporation of a gymnosperm cell. 

12. The method of claim 1 wherein the expression cassette is inserted into the 
gymnosperm genome via silicon carbide whiskers. 

13. The method of claim 1 wherein the expression cassette is inserted into the 
gymnosperm genome via transformed protoplast. 

14. The method of claim 1 further comprising inserting a selectable marker 
into the expression cassette. 

15. The method of claim 14 wherein the selectable marker is selected from the 
group consisting of kanamycin and hygromycin B. 

16. The method of claim 2 wherein the anti-sense mRNA is a gymnosperm 
genetic sequence which codes for the 4-coumarate CoA ligase (4CL) gene. 

17. The method of claim 1 wherein the promoter region is a DNA sequence 
which includes the 5' flanking region of the gymnosperm loblolly pine PAL gene. 

18. The method of claim 1 wherein the promoter, region is a DNA sequence 
which includes the 5' flanking region of the gymnosperm loblolly pine 4CL1B gene. 

19. The method of claim 1 wherein the promoter, region is a DNA sequence 
which includes the 5' flanking region of the gymnosperm loblolly pine 4CL3B gene. 

20. The method of claim 1 wherein the promoter region includes a constitutive 

promoter. 
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21. An isolated FA5H-1 DNA sequence which encodes an enzyme involved in 
the biosynthesis of syringyl lignin monomer units, wherein said DNA is as shown in SEQ ID. 
No. 1. 

22. An isolated FA5H-2 DNA sequence which encodes an enzyme involved in 
the biosynthesis of syringyl lignin monomer units, wherein said DNA is as shown in SEQ ID. 
No. 2. 

23. An isolated bi-OMT DNA sequence which encodes an enzyme involved in 
the biosynthesis of syringyl lignin monomer units, wherein said DNA is as shown in SEQ ID. 
No. 3. 

24. An isolated 4CL DNA, sequence which encodes an enzyme involved in 
the biosynthesis of syringyl lignin monomer units, wherein said DNA is as shown in SEQ ID. 
No. 4. 

25. An isolated DNA, wherein said DNA encodes for an enzyme involved in 
the biosynthesis one or more syringyl lignin monomer units. 

26. An isolated DNA sequence which includes the 5' flanking region of the 
gymnosperm loblolly pine PAL gene, containing the lignin promoter region and regulatory 
elements for gymnosperm lignin biosynthesis as shown in SEQ ID No. 5. 

27. An isolated DNA sequence which includes the 5' flanking region of the 
gymnosperm loblolly pine 4C1B, containing the lignin promoter region and regulatory 
elements for gymnosperm lignin biosynthesis as shown in SEQ ID No. 6. 

28. An isolated DNA sequence which includes the 5' flanking region of 
gymnosperm loblolly pine 4CL3H, containing the lignin promoter region and regulatory 
elements for gymnosperm lignin biosynthesis as shown in SEQ ID No. 7. 

29. An isolated DNA, wherein said DNA includes the promoter region of a 
gymnosperm gene involved in syringyl lignin biosynthesis. 
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30. A method for modifying the genome of loblolly pine which comprises 
cloning one or more angiosperm DNA sequences which code for enzymes necessary for 
production of syringyl lignin monomer units, fusing one of more of the angiosperm DNA 
sequences to a promoter region to form an expression cassette, and inserting the expression 
cassette into the loblolly pine genome to thereby produce a modified genome in the loblolly 
pine containing genes which code for enzymes which produce syringyl lignin monomer units. 

3 1 . The method of claim 30 wherein the promoter region is a constitutive 

promoter. 

32. A loblolly pine containing an expression cassette produced according to 

claim 30. 

33. The method of claim 30 wherein the angiosperm DNA sequence is 
selected from the class consisting of 4-coumarate CoA ligase (4CL), bifunctional-O-methyl 
transferase (bi-OMT) and ferulic acid-5 -hydroxylase (FA5H-1 and FA5H-9) 

34. A loblolly pine containing one or more of the DNA sequences of claim 33. 

35. A loblolly pile containing the angiosperm DNA sequence inserted by the 
method of claim 30. 

36. A method for modifying the genome of loblolly pine which comprises 
cloning the sweetgum FA5H-1 gene, fusing it to a constitutive promoter to form an 
expression cassette, and inserting the expression cassette into the loblolly pine genome. 

37. A loblolly pine containing tile FA5H-1 gene. 

38. A method for modifying the genome of loblolly pine which comprises 
cloning the sweetgum FA5H-2 gene, fusing it to a constitutive promoter to form an 
expression cassette, and inserting the expression cassette into the loblolly pine genome. 

39. A loblolly pine containing the FA5H-2 gene. 
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40. A method for modifying the genome of a gymnosperm which comprises 
cloning the sweetgum FA5H-1 gene, fusing it to a constitutive promoter to form an 
expression cassette, and inserting the expression cassette into the loblolly pine genome. 

41 . A method for modifying the genome of a gymnosperm which comprises 
cloning the sweetgum FA5H-2 gene, fusing it to a constitutive promoter to form an 
expression cassette, and inserting the expression cassette into the loblolly pine genome. 

42. A gymnosperm containing the FA5H-1 gene. 

43. A gymnosperm containing the FA5H-2 gene. 

44. A gymnosperm containing a DNA sequence selected from the class 
consisting of the FA5H-1 DNA sequence of SEQ ID No. 1, the FA5H-2 DNA sequence of 
SEQ ID No. 2, the OMT DNA sequence of SEQ ID No. 3, and the 4CL DNA sequences of 
SEQ ID No. 4. 

45. The gymnosperm of claim 38, further comprising syringyl lignin. 
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1 

PRODUCTION OF SYRINGYL LIGNIN 
Abstract 

The present invention relates to a method for producing syringyl lignin in 
gymnosperms. The production of syringyl lignin in gymnosperms is accomplished by 
genetically transforming a gymnosperm genome, which does not normally contain genes 
which code for enzymes necessary for production of syringyl lignin, with DNA which codes 
for enzymes found in angiosperms associated with production of syringyl lignin. The 
expression of the inserted DNA is mediated using host promoter regions in the gymnosperm. 
In addition, genetic sequences which code for gymnosperm lignin anti-sense mRNA may be 
incorporated into the gymnosperm genome in order to suppress the formation of the less 
preferred forms of lignin in the gymnosperm such as guaiacyl lignin 
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